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The  first  section  presents  some  basic  properties  of  spherical  harmonics,  stres¬ 
sing  their  relationship  to  two-dimensional  Fourier  series.  Algorithms  for  the  eval¬ 
uation  of  the  harmonic  coefficients  by  numerical  quadratures  are  given  here,  and  it 
is^hown  that  the  number  of  operations  is  the  order  of  N3  for  equal  angular  grids, 
where  N  is  the  number  of  lines  of  latitude,  or  *Nyqulst  frequency**,  of  the  grid. 

The  second  section  introduces  a  quadratic  measure  for  the  error  in  the  estima¬ 
tion  6t  the  coefficients  by  linear  techniques.  This  is  the  error  measure  of  least 
squares  collocation,  which  is  a  method  that  can  be  used  for  harmonic  analysis.  Ef¬ 
ficient  algorithms  for  implementing  collocation  on  the  whole  sphere  are  described. 

A  formal  relationship  between  collocation  and  least  squares  adjustment  is  used  to  ob¬ 
tain  an  alternative  form  of  the  collocation  algorithm  that  is  likely  to  be  stable  with 
dense  data  sets  and,  with  a  minor  modification,  can  be  used  to  implement  least 
squares  adjustment  as  well.  The  basic  principle  is  that  for  regular  grids  the  variance* 
covariance  matrix  of  the  data  consists  of  Toepl itz -c i rculant  blocks,  so  it  can  be  both 
set  up  and  inverted  very  efficiently. 

Section  four  describes  a  method  for  computing  the  covariances  between  area 
means  that  is  much  more  efficient  than  the  usual  approach  of  integrating  numerically 
the  point  covariance  function  twice  over  the  blocks.  This  method  Is  essential  for 
setting  up  the  variance-covariance  matrix  of  die  data  to  implement  collocation  when  the 
data  are  area  means.  In  this  and  other  respects  the  present  work  is  complementary 
to  Report  291,  by  the  same  author. 

A  c  "•  C  ' 

Finally,  there  is  an  Appendix  containing  the  listing  of  subroutines  that  imple¬ 
ment  various  algorithms,  together  with  some  explanatory  notes  on  how  to  use  them. 
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1.  Introduction 


Spherical  harmonics  are  closely  associated  with  the  basic  theory  of 
gravitational  and  magnetic  fields,  such  as  those  of  the  Earth  and  planets; 
for  this  reason  they  are  important  both  in  geodesy  and  in  Earth  and  planetary 
physics. 

The  present  work  considers  the  numerical  aspects  of  the  reduction  of  global 
data  sets  to  spherical  harmonic  coefficients,  so  the  emphasis  has  been  laid  on  the 
algorithms  for  this  purpose.  The  procedures  for  harmonic  analysis  (and  synthesis! 
given  here  are  general  enough  to  be  used  in  the  study  of  magnetic  or  electric  fields, 
but  most  conclusions  regarding  their  accuracy  are  restricted  to  the  gravity  field 
of  our  planet  and  to  fields  with  the  same  power  spectrum.  The  accuracy  of  these 
methods  cannot  be  separated  from  the  type  of  signal  being  used. 

Modern  instrumentation  has  provided  scientists  and  engineers  with  vast 
amounts  of  information,  and  modern  computers  have  made  the  processing  of  it 
possible,  and  even  routine,  thanks  to  constant  improvements  in  both  hardware 
and  software.  In  the  mid  sixties,  those  branches  of  applied  mathematics,  physics, 
and  engineering  concerned  with  the  sifting  of  data,  or  with  the  study  of  very  large 
regular  structures,  were  greatly  affected  by  the  advent  of  the  Fast  Four.er  Trans¬ 
form  (FFT).  In  spite  of  the  fact  that  spherical  harmonics  are  members  of  the 
family  of  Fourier  transforms,  closely  related  to  two  dimensional  Fourier  series, 
geodesy  has  lagged  behind  in  the  development  of  techniques  similar  to  the  FFT, 
partly  because  of  the  rather  wicked  nature  of  the  sphere  on  which  data  are  usually 
given,  partly  because  there  have  not  been  enough  data  to  make  the  development  of 
powerful  techniques  a  general  concern.  The  topological  differences  between  the 
Euclidean  plane  and  the  surface  of  the  sphere  may  very  well  prevent  the  finding 
of  algorithms  as  efficient  as  the  FFT  for  the  latter  (certainly  none  seems  to  have 
been  reported  to  date)  but  such  algorithms  should  be  regarded,  nonetheless,  as 
the  desideratum  for  all  those  who  wish  to  put  their  time  and  work  into  developing 
good  numerical  methods  for  spherical  harmonic  analysis. 

The  increasing  use  of  artificial  satellites  for  surveying  the  gravity  field, 
particularly  by  radar  altimetry  and  by  the  projected  tracking  of  one  satellite  by 
another,  are  making  the  use  of  very  efficient  and  accurate  techniques  for  handling 
the  resulting  data  imprescindible;  even  a  casual  review  of  the  literature  of  the 
last  few  years  will  show  that  serious  efforts  to  provide  such  techniques  are  getting 
under  way.  The  days  of  scarce,  scattered,  unreliable  data  are  just  about  over. 

The  remainder  of  this  section  defines  the  basic  problem  and  the  associated 
notation,  shows  the  relationships  between  spherical  harmonics  and  2D- Fourier 
series,  presents  some  of  the  similitudes  and  differences  between  both,  and  ex¬ 
plains  some  efficient  algorithms  for  harmonic  analysis  and  synthesis  that  are 
common  to  a  number  of  different  problems.  Section  2  begins  by  defining  a 
quadratic  measure  for  the  accuracy  of  the  estimated  harmonic  coefficients  based 
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on  the  covariance  functions  of  the  signal  and  the  noise;  a  discussion  on  the 
optimization  of  this  measure  follows,  leading  to  the  application  of  least  squares 
collocation  to  harmonic  analysis.  The  use  and  implementation  of  least  squares 
adjustment  follows,  and  then  a  discussion  of  the  connection  between  least  squares 
and  least  squares  collocation  ,  shown  as  alternative  and  efficient  techniques 
for  solving  the  same  problem.  The  section  closes  with  an  algorithm  for  the  case 
when  data  are  Irregularly  distributed.  Section  3  illustrates  with  several 
numerical  examples  some  of  the  methods  presented  earlier.  Section  4 
introduces  an  efficient  formula  for  computing  the  covariances  between  block 
averages  when  analyzing  mean  values  by  collocation.  This  formula  is  much 
more  efficient  than  others  based  on  the  numerical  quadratures  of  the  "point" 
covariance  function. 


1.1.  Spherical  Harmonic  Analysis  and  Synthesis;  Definitions 


A  square  integrable,  analytical  function  f(0,A)  defined  on  the  unit  sphere 
0  s  S  s  tt  and  0  £  A  <  2  tt  can  be  expanded  in  a  series  of  surface  spherical 
harmonics 


f(0,X) 


L  Pn,(cos0)  [  C„B  cos  mA  +  Sni 


■  =  0 


sinmA  ] 


(1.1) 


where:  PnI  are  the  associated  Legendre  functions  of  the  first  kind,  fully 

normalized  so -i—.  f  PB,  (cos  0)3(cos>\  ml  da  =  1  (here  f  da 
4rrJo  \Sin/  Jo 

indicates  integration  on  the  unit  sphere); 

Cal,Sn,  are  the  fully  normalized  spherical  harmonic  coefficients. 


For  the  sake  of  brevity,  the  following  alternative  notation  shall  be  used  when  possible 


-a  a  ,  _  rP„(cos  0)cos  mA  if  a  =  0 

n»  >  )  l"Pni (cos  0)  sinmA  if  a  =  1 

and 

Of  _  [C„  if  Oi  a  0 

u‘  "  is„.  if  a=i 

~ “  /y 

The  purpose  of  spherical  harmonic  analysis  is  to  estimate  the  coefficients  Cn, 
from  measurements  of  the  signal  f  (  0 ,  A).  These  measurements,  which  may  be 
corrupted  by  some  noise  or  error  signal  "n",  and  which  are  assumed  in  what 
follows  to  be  finite  in  number,  constitute  the  data.  The  individual  samples  are 

called  z13,  so  ztJ  =  f ( 0t ,  A,)  +  ntj.  The  subscripts  i  and  j  are  used  only  to 

designate  the  position  of  the  sample  in  a  two-dimensional  array,  or  grid,  covering 
in  some  more  or  less  regular  way  the  sphere:  i  corresponds  always  to  (co)latitude, 
and  j  to  longitude.  While,  as  in  the  last  paragraph  of  section  3,  some  places  in 
the  grid  may  be  empty,  the  grid  itself  is  defined  by  a  set  of  complete  parallels 
and  meridians  .  Unless  otherwise  specified,  the  separation  between  the  lines  of 
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latitude  can  be  variable,  but  that  between  meridians  is  always  constant  and 
equal  to  AX  =  -r/N,  where  N  is  an  integer.  The  grid  most  often  consi¬ 
dered  in  this  work  is  the  equal  angular  grid,  also  called  the  regular  grid; 
in  this  case  A0  is  also  constant  and  equal  to  A  A  . 

For  equal  angular  grids  i  and  j  take  values  0  £  i  £  N-l  and  0  ^  j  £2N-1, 
i  increasing  from  North  to  South,  and  j  from  West  to  East.  Formulas  where  the 
i,j  subscripts  appear  are  in  the  form  appropriate  to  the  regular  grid,  though  most 
of  them  can  be  extended  in  a  very  simple  way  to  other  partitions. 

Data  may  consist  of  values  determined  at  the  intersections  of  the  grid,  in 
which  case  they  are  referred  to  as  "point  data",  or  they  may  be  averages  over 
the  blocks  defined  by  the  lines  of  the  grid,  and  then  they  are  called  "area  means" 
or  1 'block  means".  In  the  equal  angular  grid  the  northernmost  and  southernmost 
blocks  reach  to  the  respective  poles;  there  are  N  rows  of  blocks  (i.e.  blocks  be¬ 
tween  the  same  parallels),  there  are  2N  blocks  per  row,  and  i,j  identify 
olocks  according  to  the  row  and  column  they  are  in.  Grids  for  equal  angular 
point  data  may  be  "center  point"  grids,  data  being  measured  at  the  center  of 
each  block,  so  i  =  0  corresponds  to  9  =  A  =  AX/2. 


No  blocks  stride  the  equator,  so  regular  grids  are  symmetrical  with  respect 
to  it;  in  other  words;  N  is  always  even  (extension  to  N  odd  is  trivial). 


Area  means  are  identified  here  by  overbars;  they  can  be  expanded  in  series 
simply  by  integrating  (1. 1)  term  by  term,  which  can  be  done  because  the  spherical 
harmonic  series  is  always  uniformly  convergent  for  9  s  0  5  n  and  0  £  A  £  2  ^  : 

fu  =  ~  L  t.  t?*  J  y“(0, A)  do¬ 
ff  s  1 


'!  3  n  =  0  «  =  0  0=0 

os  r.  1 


=  •  £  £  £  P!  'i9Pn«(cos  Q )  sin 9  dS  [cn,  cosmA 

Ajj  „  =  0  «=o  0=0  J0i  L  "'A. 

+  S0B ^sinmAdxJ  (1.2) 

Here  Ti3  is  the  area  mean  of  f(  0,A)  on  the  block  ff.3  whose  area  is 
AtJ  =  AX(cos  St  -  COS(  9;  +  A0)  )  . 


1  if  at  =  5 ,  m  =  p  and  n  =  k 
0  otherwise 


A  basic  property  of  spherical  harmonics  is  their  orthogonality: 
ja  y“(9,A)  Yk?(6,A)  da  =  { 
as  a  consequence  of  which 

C„“  =  T“(6,A)  f(0,A)  dff 


(1.3) 


(1.4) 
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Expression  (1.4)  is  the  inverse  of  (1.1);  both  constitute  the  basis  of  spherical 
harmonic  analysis.  In  general,  (1.4)  cannot  be  calculated  analytically,  because 
what  is  available  is  not  the  function  f  (  A),  but  a  finite  set  of  noisy  measure¬ 

ments  in  the  form  of  point  values  zl}  or  their  area  means  zj^.  Discretizing 
(1. 4)  on  an  equal  angular  grid  results,  for  instance,  in  the  following  numerical 
quadratures  formula; 


A 

C 


a 

n» 


1 

4tt 


N-l  3N-1 

Z  Z  Yn^SfAj)  f(  a1(X4) 

i  =  0  J  =  0 


Aij 


(1.5) 


a  ci  _a 

where  Cn«  indicates  the  estimate  of  Cna.  as  this  type  of  formula  is  usually  only 
an  approximation.  Formulas  resembling  (1.5)  can  be  called  "point  values-type 
quadrature  formulas’,’  and  can  be  handled  by  algorithms  that  are  all  identical  from 
a  structural  point  of  view  and  whose  prototype  is  that  of  paragraph  1.5. 


If  the  data  are  block  averages,  several  simple  approximations  have  been 
proposed  that  take  the  form 

N  — 1  aN-1 


Cn«  [An  Z  Z  ^1J  J  Yna(6,  A)  do 

1=0  J  =0  Jffij 


where  the  nn  are  scale  factors.  This  kind  of  formula  shall  be  studied  further 
in  section  2.  Writing  the  above  expression  in  full 

v’v'1-  re‘*^  rx’*aVoS1 

c„.  "  <*•  P..(cos9)  .msaej  isin-fn,XdA  <>• 

This  belongs  to  the  general  type 

N  — 1  SN-l 


G) 


6?.  -  *  e  x?  + ltjh-h  <i-7> 


.vhere  B(m)  =  | 


f(cosm  AA-l)/m  if  m^O 


if  m  =0  ’  A<m>  =l 


_  f(sin  m  AA/m  if  m  /  o 


AA 


if  m  =  0 


Formulas  resembling  (1.6)  or  (1.  7)  appear  several  times  in  this  work,  and  are 
called  here  area  means-type  quadrature  formulas. 


-.O' 

If  all  C 


the  (Nmax  +  if  terms  in 
f(0,X)Wmax 


»«  with  0  *  n  £  Nmax 


-  Z  Z  Zc 

n  =  0  "  =  0  Ot  -  0 


are  known,  they  can  be  used  to  compute 
ns  Y.®  (6,  A)  (1.8) 


which  can  be  regarded  as  an  approximation  to  f  (  6  ,  A )  at  the  point  (  9  ,  A )  . 
Expression  (1.8),  together  with  the  truncation  at  Nmax  of  (1.2)  (area  means), 
defines  the  object  of  spherical  harmonic  synthesis;  given  the  coefficients, 
estimate  the  function.  As  shown  later,  analysis  and  synthesis  are  related  by 
a  simple  duality,  so  they  require  about  the  same  number  of  operations  when 
performed  on  a  certain  grid  and  with  a  given  number  of  coefficients. 
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(1.9) 


The  set  of  all  degree  variances 

=  e„  cBl2  +  sa/ 

constitutes  the  power  spectrum  of  f(0,A).  If  all  coefficients  of  degree  n  >  Nmax 
are  zero,  so  are  the  corresponding  <Jn  ;  insuchcase  the  function  f(  9,X)  is  said 
to  be  band  limited. 


Closely  associated  with  the  power  spectrum  is  the  isotropic  covariance 
function 

CD 

cov ( f(P) ,  f  (Q)  )  =  E  CTna  Pn(COS  )  (1.10) 

n  =  0 

Here  p  and  Q  are  two  points  on  the  sphere,  fa  =  cos  cos  9?  cos  9q  sin  9p 
x  sinSij  cos(  k  -  \q)  ]  is  the  geocentric  angle  or  spherical  distance  between  both, 
and  Pn(cos  v)  is  the  unnormalized  Legendre  polynomial  of  degree  n .  The  power 
spectrum  and  the  covariance  of  a  function  on  the  sphere  ,  like  those  of  func¬ 
tions  defined  on  the  real  line,  on  the  plane, and  on  Euclidean  spaces  of  higher 
dimensions,  can  be  used  to  obtain  estimators  related  to  discrete  Wiener 
filters  and  predictors,  i.  e.,  satisfying  a  minimum  variance  condition.  The 
spherical  version  of  such  technique  is  known  as  least  squares  collocation  (V.ontz, 
1972).  The  inverse  of  expression  (1. 10)  is 

aa3  =  (2n  +  l)  Jo  Pa(cos  ijjpq)  cov(f(P),f(Q)  )  sin  d^  (1.11) 

Equations  (1.10)  and  (1.11)  show  that,  as  usual,  power  spectrum  and  covariance 
function  are  linear  transforms  of  each  other.  A  formula  such  as  (1. 11)  is  some¬ 
times  called  a  Legendre  transform. 


In  addition  to  the  autocovariance  (1.10),  one  can  define,  more  generally, 

CO 

cov(u(P),v(Q)  )  =  Yj  ca’r  Pa(cos  ihpg)  (1-1 0*) 

n  ss  0 

as  the  covariance  of  two  functions  u(0p,  Xp  )  and  v(  dQ  ,  Xq)  on  the  sphere,  where 
the  a 


c."  =  E  Caa,  c;  +  SBU.  Sl.  ,  n  =  0, 1, . . .  (1.9 

«= o 


U,v 


*\ 


constitute  the  "power  crosspectrum”. 

A  relationship  similar  to  (1.11)  applies  to  the  c Y  and  to  cov (u (P),  v(Q) ). 

A  very  important  property  of  spherical  harmonics  is  Parseval's  theorem: 

-tluB.Xf  da  -  E  (1.12, 
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The  orthogonality  of  spherical  harmonics,  and  the  fact  that  they  form  a 
complete  orthogonal  set  of  functions  on  the  sphere,  are  among  the  reasons  why 
they  are  used  so  widely  in  theory  and  in  practice,  but  there  is  more  to  them  than 
orthogonality. 


The  solid  spherical  harmonics 


-^tT“(0,X)  and  raY®(6,X) 


where  r  is  the  distance  to  the  origin  of  coordinates,  are  all  solutions  of  Laplace's 
equation, which  in  Cartesian  coordinates  is 


vaw 


...  a3w  A  a3w  . 

“  *T  +  a?- 


s3w 

3za 


=  0 


(1.  13) 


This  makes  them  appropriate  for  the  study  of  harmonic  functions,  such  as  the 
gravitational  potential,  in  spherical  coordinates.  Another  property  they  have, 
unique  among  functions  that  are  orthogonal  on  the  sphere,  is  the  relationship 


[*  f (P)  E  <2n  +  l)k*  da  =  £  £  £kBc"Y“(Q)  (1.14) 

4  TT  JC7  j  “  q  n  q  —  o  &  ss  o 

which  corresponds  to  the  Convolution  Theorem  for  ordinary  Fourier  series*  and  is 
the  basis  of  such  fundamental  formulas  as  Stokes’  in  gravimetric  geodesy. 


1.2.  Relationship  Between  Spherical  Harmonics  and  2-p  Fourier  Series 


As  indicated  in  the  previous  paragraph,  spherical  harmonics  share  important 
properties  with  ordinary  (trigonometric)  Fourier  series  in  one  or  more  dimensions. 
There  is  also  a  very  immediate  relationship  with  ordinary  two-dimensional  (2-D) 
series  that  will  be  explained  here.  From  Hobson  (1931,  Ch.  HI,  formula  (7))  we 
know  that 


Pa,(C08  Q) 


-  (rl)"  (2n)l 
2*  n!  (n-  m) ! 


sin'8  (cos— 8- 

l  2  ( 2n  - 1) 


cos 


8  + 


so  the  normalized 


(n -  m). ,  ■  (n  -  m  -  3) 
*  2*  4  •  (2n-l)(2n-  3) 

Legendre  function  is 


cos 


(1.15) 


Pnl(cos  8) 

where 

L(n,  m)  = 


=  (-D  (2nl) 

2“  n!  (n  - m) ! 

r  (n-m)/2  if 
l  (n  -  m  - 1)  /2  if 


5-1  -5* 

ak(n,  m)  cos  Q 


n-m  is  even 
n-m  Is  odd 


while  a„  (n,  m) 


(-l)v  (n-m)(n-m-l).  ..(n-m-2k  +  l)[2  •  4. . . 2k(2n - 1). . . (2n-2k+l)]_l 

for  k  >  0 
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In  the  interval  -tt  <  9  s  n  sin*  0  is  even  when  m  is  even,  and  odd  when 
m  is  odd.  In  the  interval  -n  s  e  £  ?r  the  sum  cos  a 

k  =  0 

is  always  even  (it  is  a  sum  of  even  powers  of  cos  9 ),  so  the  parity  of 
Pa  (cos  9)  is  the  same  as  that  of  m  if  -tt  s  9  £  r .  An  even  function  can 
be  expanded  into  a  sum  of  cos ines.  and  an  odd  function  into  a  sum  of  sines. 
The  highest  frequency  term  will  correspond  to  the  highest  frequency  in  the 
expansion  of  sin®  9  cos’*®  9,  so  this  term  will  be  of  the  form  a*  cosne  or 
b»  sinn0.  Therefore,  the  Legendre  function  satisfies  one  of  the  following 
equations  in  -tt  ^  9  £  n : 

a 

Pea  (cos  9)  =  £  C\  cos  t9  (a)  if  m  is  even; 

t  =0 

_  a 

Pa  a  (cos  9)  =  £  sV  sint9  (b)  if  m  is  odd 

t  =  o 

-a 

A  spherical  harmonic  Ya  (9, A)  can  have  one  of  four  possible  forms: 


for  m  even  < 


—Of 

X. 


<9,  A)  - 


for  m  odd: 

_a 

YB. 


n 

1 

■  tssO 


(9  A)  = 


,L 

a 

z 


C  4  cos  tQ 

cos  mX 

for 

a  =  0 

cVeos  te 

sinmX 

for 

II 

C**  sint0 

cos  mA 

for 

a  =  0 

n  n 

ct  sinte 

sinmX 

for 

p 

11 

M 

(1.16a) 


(1.16b) 


A  sum  of  spherical  harmonics  such  as  (1. 8)  is  equivalent  to  a  sum  of  terms 
of  the  form  cftB  state  cosmA,  C*Bsintg  sinmX,  C*ncoste  cosmA  and 
C?tn  costs  sinmX,  which  are  also  the  basic  functions  of  2-D  Fourier  series. 

The  highest  m  and  n  in  the  spherical  harmonic  expansion 

"•«  a  I  _  _q, 

f  (9, A)  =  Z  I  L 

are  equal  to  Na.  «.  so  the  highest  degree  and  order  (or  spatial  frequencies) 
in  the  Fourier  series  are  also  equal  to  Nn,x .  In  conclusion:  every  surface 
spherical  harmonic  expansion  where  the  highest  degree  is  N,„  is  identical 
to  a  2-D  Fourier  series  (where  the  highest  m  and  n  are  also  Nb„), 
in  the  domain  -tt  M  s  n,  0  s  X  s  2tt  .  The  converse  is  not  true,  be¬ 
cause  continuous  functions  on  a  sphere,  such  as  the  YbD,  must  satisfy  certain 
conditions  at  the  poles  that  ordinary  functions  on  the  -it  s  9  s  n  ,  0  s  \  <:  2n 
domain  do  not  have  to.  Spherical  harmonics  correspond  to  a  subclass 
(linear  subspace)  of  2-D  Fourier  series. 


For  example: 


(Heiskanen  and  Moritz,  Chapter  1, 1967) 
Pn  (cose)  =  /l  sin0 

(cose)  =/30  sinQ  cos8  =  -§^0  sin26 


So 


Yn  =  /3  sine  cosX,  Yux  =/3  sinS  sinX 
Yai=  2^0  sin20  cos2X,  0  sin20  sin2X 

Calling  the 2  -D  Fourier  coefficients  where 


a?- 

correspond 

to 

terms 

of  the 

form 

cos  p9  cos  raX 

aj. 

it 

tf 

tt 

t!  t! 

tt 

cosp9  sinmX 

a?« 

tr 

»t 

ft 

tf  tt 

tf 

sinpS  cos  mX 

it 

tt 

tt 

tt  tt 

tt 

sinp9  sinmX 

these  can  be  related  to  the  respective  Cn»  by  expressions  of  the  form 

C?.  «  T  £  In,p  af«  n=m,  m  +  1,  m  + 

»  (n+m)  !  p=o 


2,  .  .  . 

(1.17) 


where  (3  =  a  if  m  is  even,  and  j3  =  0£+  2  if  m  is  odd.  The  I“t  p  are 
defined  as 


Ta 


n,  p 


In 


cos  p0  Pn,  (cos  8 )  sin  Q  d  e 
sinpG  Pi* (cos  6)  sin  6  d0 


if  m  is  even 


if  m  is  odd. 


and  can  be  computed  recursively  using  the  formula 
2n  - 


I. 


*"-p  ~  2(n 
with  the  following  starting  values 


-1  fT*  .  T*  1  n  +  m  -  1 

-  m)  i  • p+  j  n  -  m 


Tn-  a,  p 


(  o  if  (m  +  p)  is  odd 


1  a,  p 


(1.18a) 


(1. 18b) 


2(m+  1)  (2m)  ! 


if  m  is 


2"  [(m  +  l)2  -  p3]  [(m  -  l)3  -  p3].  .  .[3s  -  p2]  [la  -  p2]  even,  p  even, 

_ 2  fm-t-  1^  12m1)  i _ _  if  m  is  odd, 

2*0m+  if  -  1?]  [<m  -  l)3  -  p2  ]  .  .  .  \f  -  p3]  [-p]  p  odd. 


The  l"&p  are  zero  for  alternate  values  of  both  p  and  m.  These  equations 
were  reported  by  Ricardi  and  Burrows  (1972),  and  show  how  to  obtain  the 


—  Cx  a 

C«  a  once  the  a  have  been  computed  from  the  data  by  means  of  the  2-D 
discrete  Fourier  transform.  Normally  this  would  be  impossible,  because 
the  data  only  exists  in  the  upper  half  of  the  -it  s  9  s  -  ,  0  s  X  ^  2rr 
interval ,  and  the _3-D  algorithm  requires  information  on  all  of  it.  However, 
the  fact  that  the  P„ ,  (cos  9)  are  sums  of  sines  only  or  cosines  onlj'  makes 
the  calculation  possible. 

While  the  maximum  degree  and  order  in  a  2-D  Fourier  series  do  not 
reach  the  Nyquist  frequency1  (m,  n  <  N  =  in  an  equal  angular  grid, 

all  the  coefficients  can  be  recovered  exactly  by  solving  (2N)2  equations 
such  as 


KM  3 

EE  z 

laO  i=9  B  =  o 


)COS 

jsin 


mX , 


(1. 19) 


When  n  or  m  exceed  NS(lX,  the  matrix  of  the  system  of  e  quatlons 
becomes  singular,  and  the  discrete  Fourier  trans  form  consists 
of  coefficients  that  "fit"  the  data,  but  differ  from  the  true  coefficients.  The 
estimated  coefficients  are  said  to  have  been  aliased  with  those  that  exceed 
the  Nyquist  frequency.  In  the  case  of  spherical  harmonics,  which  are  a 
special  case  of  2-D  Fourier  series,  a  similar  situation  must  arise:  the  har¬ 
monics  in  the  data  with  n  s  N  are  going  to  be  aliased  with  those  of  lower 
degree,  so  the  information  available  is  not  enough  to  recover  all  coefficients 
because  the  sampling  is  too  coarse. 

The  aliasing  of  spherical  harmonics  sampled  on  regular  grids  is  a 
consequence  of  the  aliasing  of  the  respective  2-D  Fourier  series,  so  it 
makes  sense  to  talk  of  a  "Nyquist  frequency"  in  the  case  of  those  functions. 
Having  established  the  connection  between  aliasing  in  both  types  of  series, 
it  is  time  to  point  out  also  some  Important  differences. 


1.3  Sampling  Errors 

Expressions  (1.16a  b)  shows  that  spherical  harmonics  are  finite 
sums  of  2-D  Fourier  harmonics,  which  is  not  the  same  as  being  each  a 
Fourier  harmonic.  From  this  simple  fact  follow  some  important  dis¬ 
tinctions. 


To  understand  them  better,  let  us  begin  by  stating  some  basic  pro¬ 
perties  of  Fourier  series  in  one  dimension,  which  carry  over  to  higher 
dimensions  but  are  easier  to  explain  in  one  dimension. 

~T 

If  sampled  at  a  constant  interval  AX  =  —  ,  the  following  is  always  true 
of  sines  and  of  cosines  :  * 


1Named  after  the  Nyquist  Theorem: 
of  period  2NAX  can  be  recovered  only  if 


the  Fourier  coefficients  of  a  function 
Na«x  ^  N. 
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if  m  ^  p  <  N 


3N-1 


cos  mkAX  cos  pkA  X  =  0 


a  N-i 

£  sin mkAX  sinpkAX  =  0 
v  =  o 
Sl*-1 

£  cos  mkAX  sinpkAX  =  0 

k  =  <5 

Ttese  expressions  are  discrete  counterparts  of 

r3TT  r** 

/  cosmX  cospXdX  =  /  stnmX  sinpXdX  =  0 
Jo  J  0  ...1 _ _ 


if  m  ^p  <  N 

for  all  m  and  all  p. 


(].  20-a) 


jTT 


fQ  cosmX  sinpXdX=0 


when  m  ,^p 
for  all  m  and  p. 


(1.20-b) 


and  show  that  the  orthogonal  properties  of  sines  and  cosines  are  maintained 
when  these  are  sampled  regularly,  provided  the  Nyquist  frequency  is  not 
exceeded.  From  this  follows  that 

a^  =  H  Y  (sin)  “k  AXf(kAX)  =  hf  (sin)  mXf(  >)  dX  (1.21) 

y =o  J  0 


where  H  - 


4,  if  m  =  0 

i  otherwise  ^  h  = 


in  if  m  =  0 

X  otherwise 
rr 


,  so  the  "Fourier” 


counterpart  of  (1.5),  (i. e.  (1.21))  is  an  exact  "numerical  quadratures" 
formula  for  Fourier  series.  When  the  Nyquist  frequency  is  exceeded,  the 
trigonometric  relationships 


cos  mkAX=  cos  (2N  1  m)  kA  X 
-  sinmkaX=sin(2N  *  m)kAX 


imply  that  (1.21)  will  give,  not  the  true  coefficients,  but  the  aliased  estimates 

K 


a°,  =  a° 


+  E  aahN+«i  +  E 

u  =  o  h=1 

fll«  =  a**  +  E  alahN+»  E 


a0 

a  3hN- a 


(1.2  2) 


h  =o 


h  sal 


2  hN-a 


with  K  such  that  2KN  is  below,  and  2(K+1)N  is  above  the  highest  frequency  present  in 
f(X).  Expression  (1.  20  a-b),  (1.  21),  and  (1.22)  are  the  foundations  of  discrete 
Fourier  analysis  (also  known  as  the  computation  of  the  discrete  Fourier 
transform,  or  D.  F.T.),  and  so  well  known  that  they  are  almost  second  nature 
to  many  engineers  and  scientists.  Unfortunately,  none  of  these  discrete 
formulas  has  exact  counterparts  in  spherical  harmonic  analysis,  and  this 
fact  has  been  the  cause  of  considerable  confusion.  The  most  common 
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misunderstandings  seem  to  occur  in  the  following  areas: 


(a)  Number  of  equations  and  number  of  unknowns : 

At  each  point  in  the  grid  it  is  possible  to  write  an  equation  such  as 

N  Bax  n  I  Ct  ft 

f(0iX1)=  [  [  [  c„  Yn,  (BuXi)  (1.23) 

n  =  0  »  =  0  a=0 

similar  to  (1.19).  The  maximum  number  of  coefficients  that  can  be  r cco- 
vered  vvitliout  exceeding  die  Nyquist  frequency  ( i.  e.,  those  with  n  <  N) 
is  N2.  Coefficients  of  degree  or  order  equal  or  higher  than  N  shall  not 
be,  in  general,  free  from  aliasing.  .Since  there  are  2N~  points  in  an 
equiangular  grid,  it  follows  that  the  maximum  cumber  of  fully  recoverable 
coefficients  is  also  half  the  number  of  points  (equations)  in  the  grid.  By  con¬ 
trast,  in  2-D Fourier  analysis  the  number  of  coefficients  equals  the  number 
of  points  in  the  grid  (in  the  interval  si  <  6  s  n,  0  <  X  ^  2ri).  One  could 
have  a  grid  with  only  N2  points,  as  proposed  by  Giacaglia  and  Lundquist 
(1972),  but  such  a  grid  would  not  be  equal  angular  on  the  sphere. 

The  author  had  discussed  this  problem  elsewhere  (Colombo, 

1979a,  paragraph  (5.2)  ),  showing  that  the  system  of  equations  (1.23) 
becomes  singular  when  all  coefficients  of  degree  and  order  0  £(n  ,  m)  s  m 
are  included  among  the  unknowns,  and  M  £  N.  In  other  words:  it  is  not 
possible  to  solve  for  a  complete  set  of  coefficients  to  degree  and  order  M  s  N. 
The  relevant  part  of  that  argument  can  be  summarized  as  follows:  the 
columns  of  A  f  the_giatrix  of  the  system  of  equations  (1. 23),  consist  in  succes¬ 
sions  of  values  YnB(0lf  X  , )  of  the  harmonics  corresponding  to  the  un¬ 
knowns  C®„  at  the  points  (0^  Xj)  in  the  grid.  The  scalar  product  of  two 
such  columns  is. 


N~1  2  N 

i  : 

-1 

L 

TA 

= 

N 

T  PnB(cos 

6i)  Pp,( 

1=0  J 

=  0 

l 

tzO 

3N-1 

1  cos 

|  mjAX  | 

;  cos 

|  1 

0  if 

a  i  P 

l  ' 

)  sin 

sin  i 

qj  A\  =  i 

0  If 

m  r  q 

S  =0 

1 

Vo  otherwise 

aocording  to  (1.20-a).  Therefore,  if  two  columns  correspond  to  unknowns 
of  different  orders  m  and  q,  they  must  be  orthogonal  and,  thus,  indepen¬ 
dent.  For  the  whole  matrix  not  to  be  singular,  all  oolumns  of  the  same  order 
m  must  form  sub-matrices  A(m)  that  have  full  rank.  Otherwise,  there 
will  be  columns  in  those  A(m),  and  consequently  in  A,  that  are  linearly 
dependent,  so  A  cannot  bo  inverted.  Consider  A(0),  corresponding  to  all 
unknowns  of  order  0.  This  is  a  2Nsx  (M  +  l)jmatrix,  and  the  elements  of 
the  columns  of  A(0)  have  the  same  values  as  Pn0  (cosO,) ,  0  s  n  ^  M, 

0  ^  i  £  N-l.  The  Pn„  are  functions  of  e1  only,  and  there  are  N  par¬ 
allels  in  the  grid,  so  there  are  nc.  more  than  N  independent  rows  in  A(0). 
Because  the  Pr„  (cos  0 )  are  polynomials  of  degree  n  in  oos  G,  these 
rows  form  a  sub-matrix  S(0)  of  A(0)  that  has  M  +  1  independent  columns, 


as  long  asM  +  l^N  or  M  <  N.  If  M  a  N  the  extra  columns  will  turn  S(0) 
into  a  rectangular  matrix  with  more  columns  than  rows;  in  other  words: 
there  will  be  no  square  submatrix  in  A(0)  of  rank  M  +  1  if  M  i  N,  so 
A(0)  shall  be  rank  defficient,  and  from  this  follows  that  A  must  be  singular. 

Emphasis  must  be  placed  on  the  word  c  omplete  when  referring  to  the 
set  of  "solvable"  coefficients:  it  is  possible,  by  removing  some  coefficients 
with  n  <  N  from  the  unknowns,  to  introduce  others  in  their  stead  with 
n  s  N,  but  then  the  solution  will  not  be  a  complete  set  of  coefficients. 

(b)  100%  aliasing: 

If  a  term  in  a  Fourier  series  has  a  frequency  n  >  N  ,  then  it 
will  be  aliased  with  lower  frequency  terms  and  become  impossible  to  dis¬ 
criminate  from  them.  For  most  functions  of  practical  interest,  the  higher 
the  frequency,  the  smaller  the  term,  so  the  coefficient  estimated  using  the  sum  in 
(1. 21)  will  be  dominated  by  the  lower  frequency  terms:  the  estimation  error 
is  thus  likely  to  exeeed  100%.  Estimates  above  the  Nyquist  frequency  are 
usually  regarded  as  meaningless  and  the  closer  a  term  is  to  that  fre¬ 
quency  with  increasing  n  ,  the  less  reliance  is  placed  on  its  estimate. 

In  the  case  of  spherical  harmonics,  expressions  (1. 16a-b)  show  clearly 
that  the  harmonic  yf,  consists  of  several  Fourier  terms  of  frequencies 
ranging  from  0  to  n  .  When  n  >  N  ,  only  that  part  of  the  Fourier  ex¬ 
pansion  of  above  the  NyquiSt  frequency  will  become  scrambled  beyond 
recovery;  part  of  the  harmonic  is  left  intact:  the  low  frequency  "tail", 
which  means  that  the  effect  of  aliasing  on  the  recovered  coefficients  does 
not  necessarily  reach  100%  (or  even  70%)  at  the  Nyquist  frequency,  as  shown 
in  the  examples  of  section  3. 

(c)  Orthogonality 

From  (1.20a)  follows  that  the  matrix  of  equations  (1. 19)  for  the 
Fourier  series  is  orthogonal,  so  the  coefficients  estimated  according  to 
( l,  21)  are  independent  from  each  other. 

In  the  case  of  the  YR^  orthogonality  does  not  carry  over  to  all  the 
sampled  harmonics,  unless  special  "quadratures'  weights"  are  introduced 
in  (1.5)  or  (1.6).  This  lack  of  orthogonality  affects,  for  instance,  the  for¬ 
mulas  for  mean  values  discussed  in  section  2.  The  method  of  Gaussian 
quadratures  is  an  example  of  "quadratures  with  weights"  that  gives  exact 
coefficients  when  the  Nyquist  frequency  is  not  exceeded,  though  it  requires 
a  special  grid  where  the  parallels  are  situated  at  the  same  latitudes  as  the 
zeroes  of  PMf1(cos9).  The  use  of  this  method  is  possible  because  the  product 
[Pn,(cos9)  Pp«  (cos  9)]  is  a  polynomial  in  cos  9  of  degree  n  +  p  £  2N  t 
the  grid,  however,  is  an  unusual  one.  Details  of  the  application  of  Gaussian 
quadratures  to  spherical  harmonic  analysis  are  given  in  a  report  by  Payne 
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(1971).  Other  examples  of  methods  that  recover  the  coefficients  exactly 
when  the  data  is  noisless  and  N,ax  <  N  arc:  least  squares  collocation, 
least  squares  adjustment,  and  the  algorithm  developed  by  Rice  and  Burrows 
from  expressions  (1.17)  and  (1.18  a-b). 

In  general,  not  all  the  discretised  harmonics  retain  the  orthogonality 
property,  and  the  estimated  coefficients  are  affected  by  the  values  of  many 
of  the  other  CD ,,  in  addition  to  those  whose  degrees  exceed  the  Nyquist 
frequency.  More  about  this  will  be  said  in  section  2,  when  discussing  least 
squares  collocation  and  adjustment. 

Summarising:  there  are  enough  differences  between  the  aliasing  of 
Fourier  series  and  that  of  spherical  harmonics,  in  spite  of  their  being  so 
closely  related,  to  require  a  great  deal  of  caution  before  using  the  intuition 
gained  from  one  type  of  analysis  when  attempting  the  other.  For  this  rea¬ 
son,  the  expression  "aliasing  error"  will  be  replaced  by  the  just  as  appro¬ 
priate  "sampling  error",  which  is  perhaps  less  charged  with  misleading 
connotations  because  it  has  not  been  applied  almost  exclusively  to  Fourier 
series. 

It  should  be  noticed  here  that  Gaposchkin  (1980)  has  published  formulas 
for  the  sampling  error  on  a  type  of  equal  area  grid  (i.e. ,  all  blocks  have  the 
same  area  as  the  equatorial  ones).  His  formulas  are  the  equivalent,  for  such 
a  grid,  of  expression  (1.22)  for  Fourier  analysis,  but  much  more  complicated; 
they  are  made  tractable  numerically  by  the  use  of  certain  recursive  expressions 
that  he  provides,  thus  showing  an  interesting  new  approach  to  the  study  of 
the  problem. 


1.4  Number  of  Operations  in  Analysis  and  in  Synthesis 

A 

Expressions  (1 .17)  and  (1.18  a-b)  can  be  used  to  calculate  the  C„# 
once  the  2-D  Fourier  coefficients  of  I(9lf  \}),  or  app,  ,  have  been  esti¬ 
mated  by  means  of  the  2-D  Discrete  Fourier  Transform  (DFT).  Using  the 
Fast  Fourier  Transform  (FFT^  algorithm  (Cooley  and  Tuckey,  1965),  the 
number  of  operations  required  is  proportional  to  the  number  of  data  points, 
which  is  the  order  of  Ns,  or  0(NS)  for  short.  The  FFT  is  discussed  further 
in  paragraph  1.9. 

Having  obtained  the  al*B  ,  calculation  of  (1. 17)  requires  N  oper¬ 
ations  per  C®.,  or  NxN3  =  N3  for  all  of  them.  Finding  the  coefficients 


^Most  of  these  calculations  have  die  form  of  scalar  products 

N-l  N_ 

p  =  £  ak  bk  t  so  the  basic  operation  of  finding  pk  =  akbk  +  p*-1  (p°  “0,  p  p) 

k  =0 

consists  of  one  sum  and  one  product. 
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1*^ p  by  means  of  the  recurslves  (1. 18  a-b)  adds  another  0(N3  )  operations 
that  can  be  obviated  by  computing  the  I*(P  once,  and  then  storing  them  on 
magnetic  tape  or  disk.  One  way  or  the  other,  cHN2)  +  0(N3)  operations 
are  needed  altogether,  or  cKN3)  when  N  is  large  (say,  N  s  180). 

As  already  mentioned,  this  prodedure  was  first  described  by  Ricardi 
and  Burrows  in  1972;  more  recently  (1977)  Goldstein  developed  a  very  sim¬ 
ilar  idea  and  formulated  a  similar  algorithm  for  synthesis.  Goldstein's 
method  uses  recursive  formulas  for  the  I*>p  different  from  (1.18  a-b).  The 
synthesis  algorithm  also  requires  0(N3)  operations. 

As  explained  in  paragraphs  1. 5  through  1.  7,  the  procedures  presented 
there  also  require  C^N3)  operations  for  analysis,  and  as  many  for  synthesis, 
though  they  are  formally  different  from  Ricardi  and  Burrows’.  In  paragraph 
1.8  it  will  be  shown  that  synthesis  requires  as  many  operations  as  analysis, 
because  one  is  the  dual  of  the  other. 

The  fact  that  two  rather  different  approaches  (Ricardi  and  Burrows' 
and  the  one  described  in  this  report)  require  essentially  the  same  number  of 
operations  suggests  that  't)(N3)"  might  be  a  property  of  all  analysis  and 
synthesis  algorithms  on  regular  spherical  grids  due,  somehow,  to  the  nature 
of  the  sphere  itself.  This  is  speculation,  of  course,  but  if  not,  are  there  other 
ways  of  partitioning  the  sphere  for  which  faster  methods  exist?  The  author 
has  discussed  this  possibility  before  (Colombo,  1979,  paragraph  4.6).  It 
is  interesting  to  notice  that  in  all  these  procedures  the  0(N3)  operations  are 
those  associated  with  01(  or  "column  operations";  "row  operations"  are 
only  0(1^).  In  the  case  of  the  Euclidean  plane,  the  2-D  Fourier  transform 
requires  the  same  number  of  operations  per  row  than  per  column,  0(N), 
thus  the  total  is  only  CXN3),  or  O(N)  times  faster  than  its  spherical  "counter¬ 
part." 

While  not  as  efficient  as  the  2-D  FFT,  the  algorithms  for  the  sphere 
considered  here  can  be  much  faster  than  the  straightforward  implementation 
of  expressions  (1. 5),  (1.6),  (1.2),  or  (1.8).  The  latter  has  been  the  ap¬ 
proach  of  many  scientists  who  have  developed  their  own  software,  but  whose 
main  interest  has  generally  been  far  removed  from  the  study  of  numerical 
techniques.  In  1976,  while  working  at  the  University  of  New  South  Wades 
(Australia),  the  author  developed  the  two  algorithms  of  paragraphs  1.5  and 
1.6,  and  C.  Rizos  programmed  them.  Subsequently  they  were  used  at 
Goddard  Space  Flight  Center,  in  Maryland.  To  everybody's  surprise,  Rizo's 
programs  turned  out  to  be  more  than  100  times  faster  than  those  in  use  at  the 
time,  when  run  under  the  same  conditions.  More  recently,  this  author  has 
written  the  subroutines  HARM  IN  and  SSYNTH  described  in  appendix  B. 

SSYNTH  has  been  used,  after  the  fashion  of  the  numerical  experiments  des¬ 
cribed  in  Section  2,  to  generate  64000  1°  x  1°  mean  values  (simulated  aver¬ 
aged  gravity  anomalies),  each  the  sum  of  the  90000  terms  of  an  expansion 


complete  to  degree  and  order  300.  This  took  less  than  50  central  processor 
unit  seconds  in  the  AMDHAL  470  V/6-n  owned  by  The  Ohio  State  University 
(OSU).  All  calculations  were  in  double  precision  (32  bits  words),  using 
FORTRAN  H  EXTENDED.  Precomputed  values  of  the  Legendre  functions 
(or  their  integrals)  were  read  from  tape,  and  all  operations  involving  trigo¬ 
nometric  functions  were  carried  out  by  a  Fast  Fourier  Transform  subroutine. 
These  two  characteristics,  plus  a  generally  tighter  coding,  are  the  reasons 
for  the  greater  speed  of  this  program,  compared  to  the  older  versions  men¬ 
tioned  before. 

In  all  these  methods  all  operations  along  a  given  row  or  parallel  (con¬ 
stant  Si)  are  independent  from  those  for  any  other  row,  so  a  parallel  processing 
computer  with  N  processors  (arithmetic  and  control  units)  could  analyse  or 
synthesize  a  full  grid  of  N  rows  as  fast  as  an  ordinary  computer  with  one 
central  processor  can  do  a  single  row.  This  N-fold  increase  in  speed  can  be 
obtained  with  the  same  type  of  basic  hardware  (gates,  registers)  that  is 
currently  used  in  conventional  "general  purpose"  main-frame  machines.  The 
full  power  of  the  algorithms  presented  in  this  work  will  be  realized  when 
computers  of  parallel  structure  become  more  widely  available  for  scientific 
applications  than  they  are  today. 


1.5  Algorithm  for  the  Analysis  of  Point  Values 
Expression  (1. 5)  written  in  full  becomes 

A  /y  1  <  COS  I 

C“»  *  A  £  £  p».  <cos0i>  (sin )  mj AX  f^i.Aj,)^ 

1  =  0  J  =  0 

which  corresponds  to  the  general  type 

Att  aN“1  aM-i  (cos) 

c„-  £  £  XV*  £  i  sinfmjAA  f^.A,)^,  (1.24) 

1=0  J  =  0  i~° 

where  x“*  could  be  ^  PBB  (cos0t)AtJ  as  above,  or  PM  (cos  0t)  u>j  in 
the  case  of  quadrature  w  ith  we  ights  to  t ,  etc . 

To  simplify  the  discussion,  the  grid  is  supposed  to  be  equal  angular 
and  NB»*=N-1,  This  and  the  algorithms  that  follow  can  be  easily  adapted 
for  the  equal  area  grids  current  today.  Subroutines  HARMIN  AND  SSYNTH 
(Appendix  B)  can  handle  the  cases  N,»X<N-1  and  N..x  >N-1  as  well  as 
N,„=  N-l  . 

The  equal  angular  grid  is  symmetrical  with  respect  to  the  Equator, 
and  assuming  that  ( the  same  as  P  B  „  (cos  0 1 )  or  Ptt  ,  (cos  0t )  s in  0 1) 

X*'  =  X«-i  If  a-m  is  even,  and  x\ *  -  -X^.i  if  n-m  is  odd,  one 
can  write 
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c“.  =  |ol(x*,-[T|^|mJ4Xf(8„XJ,]  + 

(■X)  xV  ^  21  |  sin  }  X{ 8n-i-i »  Xj )  \ 


(1.25) 


This  formula  suggests  the  following  procedure  in  two  nested  loops: 
START:  set  i  =  -1,  C^',  '1  -  0  for  all  0  £  n,  m  £  N  ; 

Outer  loop: 

(a)  increment  i  by  1  unless  i  =  £N  -  1,  in  which  case  STOP 


Inner  loop: 

(b)  compute  all 


(c)  find  all 


,ai 

'BB 


i)  _ 

“  BB 


+  K 


bl. 


+  (-D 


af*  I 

bV-  J 


Xi 


(1.26) 

\ 

for  0  s  n,  m  «  N  (where  "(-l)8-1  |  |  "  merely  indicates  that  is  to 

be  added  or  substracted  according  to  the  parity  of  (n-m)  );  GO  BACK  TO  (a). 

At  the  end  of  the  outer  loop,  =  C^,  . 


The  a1,  and  b1,  in  (1.26)  can  be  computed  by  taking  the  Fourier 
transform  along  row  i  and  row  N-l-i  of  the  values  of  f(  0,  X).  This  in¬ 
volves  O(N)  operations.  There  art  half  as  many  x“  *  as  C^v1*,  there¬ 
fore  it  takes  2(N  +  l)a  products,  and  just  as  many  sums,  to  form  the 

for  the  pair  of  rows  1  and  N-l-i.  Consequently,  there  are  0(N)  + 
0(N*)  operations  per  pair  of  row^  or  0(N®)  +  OfN3)  for  the  grid  as  a  whole. 
This  is  the  same  as  with  the  Rlcardl  and  Burrows'  algorithm,  quickly  ap¬ 
proaching  Q(N3)  as  N  increases. 


Subroutine  HARM  IN  (Appendix  B)  Implements  this  technique. 
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1.6  Algorithms  for  the  Synthesis  of  Point  Values 
Expression  (1.8)  is  of  the  general  form 

N-l  a 

f(9i.X3)  =  [  [  x"  [Cu  cosmXj  +  SB,  sinmXj]  (1-27) 

a  =  0  ®=0 

where  Xi  can  be,  for  instance,  KpBt(cos0t),  etc.,  with  K  being  a  proportionality 
constant.  Rearranging  terms  and  considering  the  parity  of  x* "  leads  to 


IWi.X,)  _  Y(r  f1  |xV 
!f(<Wi.  X,)  “  .toUnk  |(-Dtt-xV 


CM  cosmjAX  + 


n-i 


[i[  |<$**?|  M 8inmi4X) 

which  suggests  a  procedure  in  two  nested  loops: 

START:  set  i  =  -1 
Outer  loop: 

(a)  increment  l  by  1,  unless  l  =  £n  -  1,  in  which  case  STOP 

Inner  loop: 

(b)  compute  all 

a.  »  r  iV  '  £*• 

3:1  .t.  Xl  is.. 


(1.28) 


N-V-t 


i-i-i 


N-l 


=  Z  (-1) 


CB. 


for  0  s  m  ^  N; 
(c)  find  all 


^  (  ®t  »  Xj) 

»  X.  ) 


N-l 

=  *E| 

*  sso 


of1 


cos  mj  A  X  + 


I2L. 

Pi 


slum  jAX 


(1.29) 


for  0  s  ]  s  2N-1  (where  (-1)“"*  { —  }  means  the  same  as  in  the  previous 
paragraph);  GO  BACK  TO  (a)  . 


At  the  end  of  the  outer  loop  all  f  (  0, ,  X4)  in  the  grid  are  known. 


-17- 


Expression  (1.  29)  is  computed  by  applying  the  EFT  to  the  2N  ct\ ,  (per  row) 
and  the  2N  ~ ,  taking  O(N)  operations.  The  first  part  of  the  inner 

loop  (forming  the  ,  jS,1  and  a",'1'1,  P!T^)  involves  C^N3)  operations; 
for  all  £n  pairs  of  rows  the  total  is  0(N  )  +  0(N3),  and  this  tends  to  0(N‘ ) 
as  N  increases. 

Subroutine  SSYNTH  (Appendix  B)  implements  this  technique. 


1.7  Algorithms  for  the  Analysis  and  Synthesis  of  Area  Means 


(I)  Analys is 


Rearranging  (1.7) 


%*Y  [|££  CiM-D 


N-Ul  . 

a  a  ) 


B(m) 

A(m)] 


b»+  ('I)"" 


(1.  30) 


were  a.,  b,  are  the  same  as  in  paragraph  (1.5).  A  procedure  similar  to 
that  for  the  analysis  of  point  values  can  be  obtained  directly  by  replacing  the 
bracket  in  (1.26)  with  that  in  (1. 30),  and  then  proceeding  as  in  the  algorithm 
for  point  values.  The  total  number  of  operations  is,  once  more,  C^N2)  +  0(N3), 
or  Q(N3)  for  large  N.  Subroutine  HARMIN  also  implements  this  algorithm. 


(II)  Synthesis 


0  i+A6 


Truncating  the  series  in  equation  (1.2),  replacing  J  &  PBp  (cos  0)  sin0d0 


with  Xi",  rearranging  terms, considering  the  parity  of  and  using  a,  ,  )9 

as  defined  in  paragraph  (1.6),  leads  to  the  expression 


I u 

[|&- 


-  t 

- 

1 

aa 

j 

5, 

1 

N1-1 
Oi  p 

A(m)  -  B  ~ 

B(m) 

J 

■  =  0 

L 

B(m) 


+ 

$  • 


(1.31) 


A(m)  sinmjAX 


Thealgorilhm  for  the  synthesis  of  the  Fs  j  is  a  direct  extension  of  that  for 
point  values.  The  number  of  operations,  once  more,  is  0(N2)  +  0(K3 )  for 
large  N  .  Subroutine  SSYNTH  implements  this  algorithm  as  well. 


1.  8  Duality  between  Analysis  and  Synthesis 

Pairs  of  direct  and  inverse  linear  transforms,  such  as  Fourier  trans¬ 
forms,  possess  dual  characteristics:  certain  words  and  mathematical 
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expressions  can  be  arranged  in  pairs  (a,b)  such  that,  if  every  "a"  is  replaced 
by  its  "b"  in  any  statement  or  equation  valid  for  a  function  f  ,  then  the  mod¬ 
ified  statement  is  valid  for  the  transform  F  c£  f  ,  and  viceversa. 


Analysis  and  synthesis  of  spherical  harmonics  are  reciprocal  linear 
transformations  of  data  into  coefficients  and  of  coefficients  into  "data"  closely 
akin  to  Fourier  transforms,  so  they  can  be  expected  to  exhibit  some  dual  prop¬ 
erties.  Comparing  the  formulas  and  algorithms  in  paragraphs  (1.5),  (1.6) 
and  (1. 7)  shows  many  similarities,  among  them  the  number  of  operations. 

This  can  be  understood  as  being  a  consequency  of  duality.  To  make  this  point 
clear,  consider  the  following  pairs: 


( 

f(%  .x3) 

A 

f(  ®i»  ^  j)  1  )(and  similarly  with 

ftJ 

V 

f(®M_l_t  X  j) 

’ 

S..I /•[ 

Sn. 

’ 

f(  t  Xj)j/ 

Th-  1—1 

/»«  /. 
\X  i  r 


l)n_*  ,  sinmjAX)  ;  (i,  m) 


(3. a);  (X?, 


cos  mjAX) 


From  these  we  can  derive  the  following  pairs: 


a* 


a' 


N-l-i 


V-l-i 


ibi  I 

bfH  • 


Pm 


A(m) 

ffi(m) 


a  ■ 

,  aN-M 


B(m) 

A(m) 


(A 
■  pT 


(a,  ,  A(m)) ;  (b1,  ,  B(m))  ;  (aTx_1  ,-B(m))  » (b*1  1  ,  A(m));  etc., 
and,  in  conclusion: 

("ANALYSIS",  "SYNTHESIS") 


Each  one  of  the  analysis  algorithms  becomes  its  synthesis  counter¬ 
part  by  a  simple  replacement  of  terms.  Once  an  algorithm  for  analysis 
(synthesis)  is  defined,  the  corresponding Ugorlthm  for  synthesis  (analysis) 
follows.  For  instance,  one  can  easily  apply  the  principle  of  duality  to  the 
Ricardi  and  Burrows'  method  of  paragraph  (1.  4)  to  obtain  a  synthesis  technique. 


1.9  Usefulness  of  the  Fast  Fourier  Transform  Method 


The  excellent  book  by  Brigham  (1974)  gives  a  thorough  presentation 
of  the  l-D  discrete  Fourier  transform  and  its  applications,  and  explains 
in  detail  the  method  known  as  FFT  for  computing  such  transform.  The 
Fourier  transform  in  2  and  higher  dimensions  chn  be  found  simply  as  follows: 


first,  get  the  1-D  transform  of  each  row,  then  that  of  each  column  of  the  mod¬ 
ified  array  ....  etc. ,  until  all  dimensions  have  been  exhausted  in  this  way. 
Understanding  the  workings  of  the  l-D  transform  is  enough  to  understand 
those  of  the  N-dimensional  transform  as  well.  The  FFT  requires  0(number  of 
points)  operations  for  each  row  along  the  nth  dimension,  so  the  number  for 
all  points  in  a  regular,  euclidean  array  is  always  of  the  order  of  the  total 
number  of  points  in  that  array. 

Before  the  mid-sixties  _when  the  FFT  came  along  _  the  best  techniques 
available  for  the  analysis  of  data  on  regular  arrays  required  0(number  of 
points)2  operations.  The  increase  in  speed  of  o(number  of  points)  brought  about 
a  true  revolution  in  data  processing:  work  that  had  been  long  regarded  as 
impossible  became  feasible  overnight,  the  field  of  industrial  and  scientific 
applications  for  numerical  Fourier  transforms  expanded  tremendously; 
the  impact  in  areas  as  diverse  as  cristalography  and  communications  engi¬ 
neering  was  remarkable. 

Having  mentioned  the  positive  side  of  the  FFT,  which  is  used  in  the  al¬ 
gorithms  described  so  far  ( at  least  in  principle)  and  in  the  programs  HARMIN 
and  SSYNTH,  it  is  only  proper  to  say  something  about  its  alternatives. 

The  FFT  calculates  aU 2N  Fourier  coefficients  a,,  b,  very  efficiently, 
but  takes  just  about  as  many  operations  to  get  only  a  few  coefficients  as  it 
takes  to  get  all:  forN,,*  small  compared  to  N  there  may  be  a  real  dis¬ 
advantage  in  using  the  FFT.  The  FFT  is  most  efficient  when  the  grid  is 
such  that  N  is  an  integer  power  of  2.  The  grids  used  in  geodesy  are  usually  based 
on  the  division  of  the  circle  In  360°,  and  many  on  the  sexagesimal  division 
of  1°  as  well,  hi  all  of  these  N  contains  factors  other  than  2,  so  a  less  ef¬ 
ficient  version  of  the  FFT,  known  as  the  mix-radix  FFT  (Singleton's  algorithm) 
must  be  used. 

Finally,  the  mix-radix  algorithm  is  rather  convoluted,  so  it  is  best 
to  take  ready  available  subroutines  from  software  libraries  (as  it  is  done  in 
HARMIN  and  SSYNTH)  rather  than  to  incorporate  the  FFT  "on  line"  in  the 
program  one  is  writing.  This  means  that  the  program  is  going  to  be  less 
self-contained. 

The  "pre-FFT"  methods  can  be  more  efficient  than  the  FFT  when 
N,,««  N ;  they  are  also  very  easy  to  program.  For  the  sake  of  completeness, 
the  outline  of  a  method  this  author  has  used  quite  often  will  be  given  here. 

Consider  the  trigonometric  relationships 
cos(aj9)  f  2cos,8  cos(a- 1)  0-cos(ot-2)0  (1.32) 

sin(a0)  =  2cos  0  sln(a-  1)  8-  sin(a  -  2)  0  (1.33) 
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with  a,  j8  real.  If  all  the  values  of  the  trigonometric  functions  could  be 
obtained  with  one  operation  oaclu  the  number  of  operations  involved  in 


find  ing 


£5  N-l 

'So 


cos 

sin 


mj  AX  ( X  s)  f(  0j ,  X j)  woulc3  bo  2(2N)»  or  4N2  fox 


all  0  <  m  s  n.  In  other  words:  0(NS),  as  expected.  This  is  precisely  what 
can  be  done  using  ( 1. 32), (1. 33)  as  recursive  expressions  for  cosm(jAX)  and 
sinm(jA  X)  with  m  integer  and  0  £  j  £  2N  -  1.  The  values  of 
cosm(-AX)  =  cos  mi  X  and  sinm(-AX)  =  -sin  mi  X  ,  are  needed  to  start  the 
recursion;  they  can  be  calculated  with  standard  trigonometric  subroutines. 

The  use  of  such  subroutines  Increases  the  number  of  operations  slightly  over 
4N2,  but  if  N  is  large  enough,  this  is  negligible. 


With  all  calculations  carried  out  in  double  precision,  this  method 
gives  values  of  cosine  and  sine  that  coincide  to  better  than  5  significant 
figures  with  those  provided  by  the  standard  FOItTRAN  functions ,  when 
N  is  as  large  as  1800  (0. 1°  x  0. 1°  grid).  By  taking  advantage  of  half-wave 
symmetries  in  the  sine  and  cosine,  and  by  ingenuous  programming,  the  num¬ 
ber  of  operations  can  be  reduced  by  a  factor  of  4  or  more  rather  easily. 


1. 10  Functions  Harmonic  in  Space  and  their  Gradients 


If  f(0,  X,  r)  satisfies  Laplace's  equation  v2f  -  0  in  the  space  outside 
a  sphere  of  radius  a  ,  then  it  can  be  represented,  in  that  space,  by  the 
solid  spherical  harmonic  expansion 


«(0.X,r)  -lit  C“B  Y®  (4 

n ~ o  a  —  o  0  1 


X) 


(1.34) 


If  we  consider,  at  a  point  P  ^  (0,  X,  r)  in  space,  the  local  triad  r,  h,  t 
oriented  downwards  to  the  origin,  West  to  East  along  the  tangent  to  the  local 
parallel,  and  North-South  along  the  local  meridian,  the  components  of  the 
gradient  of  f(0,  X,r)  along  this  three  axes  are 


M  (9 
3r 


3h 


n  =  0  B  =  0 

co  n 


ti=  o  »  =  0 

m  Cna  sin  m  X] 

go  n 

|f(0,X,r)=  l  t 

at  n  =  0  "=0 


ll 

o 

an  ,  4  T-a  -ot 

pr+a  (n  +1)  C„ „  Yno 

(0,  A) 

(1.35) 

a" 

fwC+2 

PnB  (cos  9)  cosec  (8) 

lmSnD  cos mX  - 

(1.36) 

an 

]T»+S 

dPn»  (COS0)  lCns  COSmX+Sno 

d  0 

sinmX] 

(1.37) 

1 


1Hereone"operatiorf,'  as  mentioned  in  paragraph  (1.4),  consists  of  one 
sum  and  one  product. 
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Expressions  (1.34),  (1.35),  (1.36),  and  (1.37)  appear  often  in  the  discussion 
of  geodetic  problems,  and  their  calculation  from  a  set  of  coefficients  is  an 
important  problem.  If  all  the  values  of  any  of  them(\vith  r  constant ^nd 
on  a  regular  grid)  arc  required,  the  methods  discussed  so  far  can  be  used 
after  a  few  minor  additions. 


The  synthesis  algorithms  can  be  thought  of,  for  the  purpose  of  this 
discussion,  as  "black  boxes"  with  the  coefficients  CbB  >  the  Legendre  func¬ 
tions,  N,ax,and  N  as  inputs,  and  all  the  2N2  values  of  f(8,A)  on  the 
corresponding  regular  grid  as  outputs.  To  compute  the  expressions  given 
above,  only  the  part  of  the  input  consisting  in  the  coefficients  and/or  the 
Legendre  functions  has  to  be  modified  before  they  enter  the  "box",  which 
r^pains  untouched,  ^or  instance:  to  compute  (1.  34)  one  should  replace 
•C, with  "  pj+2  C0  *  "  in  the  "input";  the  others  are  equally  obvious  and 
will  not  be  explained  further.  The  following  recursive  formulas  can  be  used 
to  obtain  the  derivatives  of  the  Legendre  functions: 

dP„  =  (sin0)-1  ]n  Pn„(cos0)  cos 0  -  [~(n  ^  Pn-i  m(cos  0)f 

d~0  L  I2*'1)  J  (i.38a) 

lsin0  ir5”-1  --1  +  cos0  P*-J  -1!  (1*38b) 

with  the  starting  value 

Si®"  - ° 

These  recursives  follow  from  the  unnormalized  formula 


(cos20-  1)  d^PB  B  os  0)  =  n  cos  0  PBB(cos0)  -  (n  +  m)  P^i.  (cos  0) 

(N.N.  Lebedev, 'Special  Functions",  Dover,  1972,  Ch.  7,  equation  7. 12. 16),  and  from 

i 

p  =  ri^LlLl2  sin©  Pn-i  n_i  (see  paragraph  (4.4)) 

L  (2n  -  1)  J 


and 


P„  =  r  -  - ,  ]*  p..  i  -  psm  p. 

L  2{2n  f-  1)  (n  -  m)  !  J 


2(2n  f-  1)  (n  -  m) 

The  complete  recui'sive  expressions  for  the  Pn  B  are  given  in  paragraph  (4.4). 


The  expansion  for  the  area  means  defined  by  (1.2)  can  be  differentiated 
terra  by  term  because  it  converges  uniformly.  The  expressions  for  area 
means  gradients,  equivalent  to  those  given  here  for  point  values^  arc  immed¬ 
iate.  They  can  be  computed  after  simple  modifications  to  the  CBB  and/or 
the  xV»  and  using  the  same  programs  for  computing  the  area  means. 

Subroutine  "LEGFDN",  listed  on  Appendix  B,  can  compute  both  the  nor¬ 
malized  Pn„(cos  0)  and  their  derivatives  dPB„(cos0). 
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2.  Error  Measure  and  Optimal  Quadrature  Formulas 

This  section  introduces  a  criterion  for  quantifying  the  errors  of  num¬ 
erical  quadratures  formulas  that  is  based  on  the  statistical  properties 
of  the  data.  Three  qualities  are  highly  desirable  in  an  error  measure:  (a) 
it  should  be  easy  to  determine;  (b)  it  should  be  mathematically  tractable; 

(c)  it  should  provide  a  good  idea  of  the  likely  size  of  the  actual  errors.  Point 
(a)  is  taken  into  account  by  choosing  a  quadratic  measure,  because  the  numerical 
formulas  are  linear  estimators  of  the  C®, ,  and  the  linear,  quadratic  esti¬ 
mation  problem  is  fairly  simple,  with  its  mathematical  side  very  well  under¬ 
stood  and  developed  today,  which  takes  care  also  of  (b).  Regarding  (c),  the 
reader  will  have  to  wait  till  section  three,  where  certain  evidence,  obtained 
from  numerical  experiments,  supports  the  assertion  that,  though  statistical 
in  nature,  the  error  measure  adopted  represents  the  actual  errors  very 
closely. 

Having  defined  the  error  measure,  the  notion  of  optimal  or  best  formula 
according  to  such  measure  is  investigated,  leading  to  the  application  of  least 
squares  collocation  and  least  squares  adjustment  to  spherical  harmonic  analysis. 


2.1  The  Isotropic  Covariance 

The  Isotropic  covariance  (expression  (1. 10))  between  two  functions  u(6,Al, 
v(9  ,A)  on  the  unit  sphere,  both  expandlble  In  spherical  harmonic  series 


00  a  1 

u(0.  A)  =  Ku  £  Z  Z 

aft 

^(d,X)  ; 

n  =  0  »=o  0P=0 

V  (  6»  A)  =  KrZ  Z  Z 

cv“ 

^  n* 

a  =  0  ,=  o  0=0 

can  be  formally  defined  as  follows 

cov(u(P),  v(Q))  =  M|u(P)  v(Q)j  (2.1) 

where  M  {  }  is  the  isotropic  averaging  operator  and  P  and  Q  are  two  points 
on  the  sphere  separated  by  the  spherical  distance  0t9  .  The  operator  m{  } 
symbolizes  the  average  of  its  argument  (in  the  present  case  the  product 
u(P)  v(Q) )  over  all  rotations  of  the  sphere.  This  can  be  visualized  if  one 
thinks  of  the  points  P  and  Q  as  given  in  a  fixed  system  of  coordin.tes,  while 
the  sphere,  on  which  u  and  v  are  defined, rotates  in  all  possible  ways.  After  all  the 
(infinitely  many)  possible  rotations,  the  average  product  u(p)  v(Q)  will  be  iden¬ 
tical  to  cov(u(P),  v(Q)).  This  kind  of  covariance,  though  purely  geometrical, 
resembles  closely  that  of  stochastic  processes  such  as  time  series. 


The  importance  of  the  isotropic  operator  and  the  isotropic  covariance 
function  in  spherical  harmonic  analysis  stems  from  the  fact  that  the  latter  can 
be  described  as  the  estimation  of  certain  parameters  of  a  function  f  ( S,  X), 
the  Cj,  ,  from  data  sampled  on  a  sphere.  From  paragraph  (2.7)  on,  this 
report  deals  with  optimal  estimators  for  the  C  ^ ,  based  on  the  theory  of  least 
squares  collocation.  Such  optimal  estimators  minimize  a  quadratic  measure  of 
the  error  that  is  defined  in  terms  of  the  operator  M  {  [  ,  this  measure 
being  introduced  in  paragraph  (2. 4). 

The  idea  of  least  squares  collocation  is  related  to  the  basic  principles 
of  such  linear,  minimum  variance  estimators  for  time  series  as  the  Wiener 
and  Kalman  filters,  which  have  found  wide  application  in  the  physical  sciences 
and  in  engineering  over  the  last  thirty  years,  and  have  been  generalized 
to  deal  with  both  continuous  and  discrete  time  processes,  and  also  "processes" 
in  more  than  one  dimension,  such  as  are  found  in  pattern  recognition  and  in 
digital  image  enhancement.  Two-dimensional  Wiener  filtering,  of  which  the 
reader  can  find  several  fine  descriptions  in  the  special  issue  of  the  Pro¬ 
ceedings  of  the  IEEE,  Vol  65,  No.  6,  1977,  is  also  applicable  to  "flat-Earth" 
geodetic  calculations;  least  squares  collocation  can  be  regarded  as  the  ex¬ 
tension  of  this  type  of  filtering  to  calculations  on  the  sphere.  Isotropic  av¬ 
erage  operators  are  not  the  only  ones  that  could  be  used  in  the  "statistical" 
approach,  though  they  are  probably  the  easiest  to  work  with  and,  perhaps,  the 
best  for  the  sort  of  application  considered  here.  For  a  description  of  other 
likely  operators,  the  reader  is  referred  to  the  paper  by  Rummel  and  Schwarz 
(1978).  Probably  the  most  didatic  introduction  to  the  method  of  collocation 
remains  Heiskanen  and  Moritz,  (Ch.  7,  1967). 

Reasoning  as  in  Heiskanen  and  Moritz  (ibid),  one  can  show  that 

CO 

COV(U(P),  v(Q)  )  =  £  C“n,V  Pn(COS^pQ) 

a  =  0 

which  is,  in  fact,  expression  (1. 10*)  _  the  definition  of  the  isotropic  covariance 
given  in  section  1  without  any  reference  to  M  {  }  .  Similarly, 

CO 

COV(U(P),  U(Q))=  £  <*»  P„(COS0pQ) 

a=0 

usually  known  as  "the  covariance  of  u  "  (expression  1. 10),  while  (1.10*) 
represents  the  "covariance  between  u  and  v  ",  or  "the  crosscovariance 
of  u  and  v  ".  The  one-to-one  relationship  between  covariance  and  power 
spectrum  (or  crosscovariance  and  crosspectrum)  should  be  clear 
from  these  expressions. 

— CK 

To  apply  the  notions  introduced  above  to  the  Ct„  it  is  necessary  to 
think  of  them  as  functions  rather  than  fixed  values.  This  is  possible  if  one 
considers  changes  in  the  coordinates  8,  X  brought  about  by  rotation.  Each 
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such  change  results  In  different  coefficients,  though  the  function  they  describe 
is  the  same,  only  rotated  with  respect  to  the  old  system.  The  new  system 
can  be  related  to  the  first  by  three  angles:  the  coordinates  9,  X  of  the  shifted 
north  pole,  and  the  azimuth  A  of  the  zero  meridian.  Therefore,  the  c'jf, 
are  functions  of  9,  X,  A  and  this  is  enough  to  define  the  average  over  all 
rotations  of  the  product  of  each  C®,  by  itself  or  by  another  function,  in  a 
meaningful  way.  Two  important  properties  of  spherical  harmonics  are: 


(A) 


I  A  —  a  —  9 

M  £  c„  +  s„ 

(  *  =0 


M 


<7* 


(2.2) 


i.e. ,  the  power  spectrum  is  invariant  with  respect  to  rotations.  This 
follows  from  the  plain  fact  that  the  integral  f  fs  ( 9,  X)  da  is  invariant 

•'a 

over  rotations,  and  from  Parseval’s  identity  (1.12);  (2.2)  implies  that  the 
isotropic  covariance  function  (1. 10)  is  likewise  invariant. 


(B) 


M 


_p 

Y».<>)  Y„(9) 


=  0  if 


a  *  9 
n  i  p 
m  4  q 


for  all  P  and  Q 


(2.3) 


i.e. ,  the  orthogonality  properties  of  spherical  harmonics  with  respect  to 
integration  on  the  sphere  are  also  true  with  respect  to  averaging  over  ro¬ 
tations. 

As  a  consequence  of  (2. 2)  and  (2. 3)  above,  the  following  relationships 


are  also  true: 

M|c„,  Mis..  *1= 

(2. 4-a) 

/y  A 

M  C!,}  -  0  if  j 

n  4  p 

(2.4-b) 

1 

m  i  q 

M{Cf,  }  «  0  if  n  4  0 

(2.5) 

Mic“.  f(9,X)|  *  **  (0'X) 

(2.6) 

M  I  C  a  ■  1  li}  (in  +  l)h 

77"  A,  Y»*  ( 9,  X) da 

(2.7) 

-25- 


For  the  derivation  of  (2.  G)  the  reader  can  see  (Rummel,  197G),  and  (Sjoberg, 
1978)  for  (2. 7).  As  for  (2. 4a -b)  and  (2. 5),  the  proof  is  given  now. 


According  to  (1.4): 

16 m{(c“,)8]-=  M  {/y£((XX)  f(Q;X’)  do£  (91 X')  f<e;X‘)  da'}  = 

xMU  X'  (e.X)f(3,X).  f  ( 6',  X' )  Y^b  ( 0 X')  da  da'  = 
fa  fa,  VnaB(0,X)  y“  <0\X'>  M{f(U)f(S',X')}  dada'  = 

fa  fa  Y®  (6.  X)  y“  (0’,X')  cov  (f(P),  f(Q))dada’ 
where  P=(0,  X)  and  Q  =  (0',X').  According  to  (1. 10): 

GO 

167T2  m{(C^b)2}=  fa  ^(0,  X)da  /a.  y“  (0',X’)  IQ  a2  Pn(cos&,)d a’  = 

a  '  2  ^  2 

=4tt  Jo  Y»*  (6,  X)  Y®  (0  ,  x  )  da  =  -5— J  16tt3  because  of  (  L.  14) 

Similarly, 

m{c".  Cp,}=  2iT+T  *  "4^  fv  (0.X)  YpPq  (0,X)da  *  0  if  n/p 

m/q 

Finally,  recalling  that  Y°00(Q,X)  =  P00(cos  0)  =  1  for  all  -tt  £  0  £  n  and  all  X, 

m{c“}=  p*  Y«(0,X)  M{f(0,X)}  da=M{f(0,X)}  j^YnaB(0,X)da 

=  M.{f(6,X)}^  y,“(0,X)  y£0  (0,  X)  da  =0  if  n  /  0  , 
which  completes  the  proof. 

2. 2  Some  Additional  Notation 

So  far,  data  points  on  the  sphere  have  been  identified  by  the  subscripts 
i  and  j  .  Alternatively,  they  could  bo  arranged  according  to  a  siigle  subscript 
k  =  2Ni  +  j  (where  N  is  the  number  of  parallels  in  a  grid  and  2N  the  num¬ 
ber  of  meridians),  so  the  points  in  the  "0"  row,  ordered  by  increasing  j  ,  are 
followed  by  those  in  the  "l"  row,  in  the  same  order,  etc. ,  the  last  element  in 
the  "N  -  1"  row  closing  the  sequence.  Based  on  this  convention,  the  set  of  all 
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values  of  f  (0 1  ,Xj)  (or  f  it)  can  be  arranged  in  Np  -  vector  form  according  to  k 


lzo  Zl'  •  •  z  i 


where 


z  k  ~  f  (9  i » Xj)  or  zk  -  T  tj 


with  k  =  2Ni  +  j 


and  where  Np  is  the  number  of  data  points,  or  2N3  for  equal  angular  grids. 
In  a  similar  way,  the  coefficients  C  „B  can  be  ordered  according  to  a  single 
subscript  p  =  n2  +  an  +  m  +  1  (with  the  understanding  that  the  meaningless 
3n0  are  not  included  )  defining  the  following  Nc  -  vector: 


t  C  0  C1  •  •  •  Cp.  .  .  CN(1  ]T 


~  C. 


p  =  n2  +  an  +  m  +  l 


where  N0  =  (Nb0X  +  l)2  .  Using  this  notation,  expression  (1.  27)  for  point 
data  quadratures  can  be  written 


=  Inn  z 


(2.10) 


where  the  estimator  vector  faB  ,  of  dimension  Np  ,  has  elements  of  the  form 


vn«  {cos) 
Lain/ 


mj  AX 


under  the  convention  given  above  relating  k  >  I  and  j. . 

a  a  a 

Grouping  all  the  estimates  CBB  in  a  vector  c  ordered  in  the  same  way 
as  c  ,  the  relationship  between  the  Cn®  and  £  can  be  written,  in  matrix 
form, 


c  =  F  z 


(2.11) 


where  F  is  the  estimator  matrix  implied  by(2. 10).  It  is  a  NpxN0  matrix 
(where  Np  is  the  number  of  data  points  in  the  grid),  each  row  be in^ formed  by 
the  coefficients  of  the  quadrature  formula  for  the  corresponding  Can.  Such 
row  is  also  the  transpose  of  the  estimator  vector  of  this  ,  designated  _f^B 

in  (2.10). 


In  the  same  way  as  the  covariance  function  between  scalars,  the  cov- 
iance  between  vector  functions  can  be  defined  in  terms  of  m|  j- : 


r  n 

M  as  z  j  =  Clt 


(2.12) 


where  CtI  is  the  covariance  matrix  of  of  dimension  Np  x  Np  .  This 
matrix  if  a  function  of  the  relative  positions  of  the  points  in  the  grid  on  which 
?_  has  been  determined,  in  ihe  same  way  as  the  scalar  covariance  depends  only 
on  the  distance  between  two  points.  The  elements  of  CIX  arc 


c”  =  M  ;zf  z.}  =  M;f(Pr)  f(P.)}, 

i.e. ,  the  values  of  the  scalar  covariance  corresponding  to  pairs  ef  points 
in  the  grid. 


In  the  same  way 

r 

M\cc  J  =  C  (2.13) 


is  a  N0  x  Na  diagonal  matrix  according  to  (2. 4a-b) .  C  is  the  covariance 
matrix  of  the  coeffic  ients.  Similarly,  the  covariance  between  c  and  z  is 


M 


»  C 


o  z 


(2. 14) 


where  C„  Isa  N,xN,  matrix,  the  elements  of  which  are 

c  a*  =  M  i_c  5  z,}  =  M^Ctt>f(©i,X1)j' 

where  the  right  hand  side  is  given  by  (2.6). 

Finally,  when  estimating  the  C  not  from  samples  of  f  (9,X),  but 
from  measurements  corrupted  by  noise 


m(fli.Xj)  -  f(«i,Xj)  +  ntl  (2.15) 

the  measurement  errors  can  be  grouped  in  a  Np  -  vector  n  with  the  same 
ordering  as  z_  ,  and  the  sum  of  both  will  be,  then,  the  Np  -  vector  of  ob- 
served  values 


m  =  z  +  n 


(2.16) 


The  measurement  errors  are  values  that  occur  in  time,  as  successive 
observations  are  carried  out:  they  constitute  a  time  series.  The  average 
operator  appropriate  to  them  is  the  usual  statistical  expectation  operator 


l  j 


The  measurements  are  suppossed  to  be  unbiased,  so  E 


opera toi 
■i  nk  ►  = 


for  all  k  .  The  covariance  implied  by  this  operator,  is  theusual  statistical 
covariance:  E-|nk  j  «  cfc3*  ,  and  E  |nknrj-  =  c^,r  •  This  can  be 


generalized  for  the  noise  vector  a  : 
E{nnT}  =  D 

where  D  is  a  Np  x  Np  matrix  of  elements 
d*r  =  E{nkn,}  =  a\T 


(2. 17) 


(2. 18) 


Both  C  ,  t  and  D  have  In  common  a  very  important  properly: 
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xTC,,x  ^  01  If  xT  ^  0  ,  x  any  Np  vector, 
xT  D  x  2  0  J 

i.e. ,  they  are  always  positive  matrices,  moreover.  In  all  the  cases  considered 
here,  at  least  D  is  positive  definite? 

x T  D  x  >  0  for  all  x 

Positiveness  can  be  Inferred  readily  from  the  definitions  of  Ctl  and  D  : 
i.e.,  xTD  x  =  x  E  UnT}x  =  E{xTnnTx}  =  E  {h2}  *0 

(where  h  =  xT  n) ,  and  similarly  for  xT  C  tt  x^with  j  j  . 

2.3  Estimation  Errors,  Sampling  Errors,  and  Propagated  Noise 
A  linear  estimator  is  of  the  general  form 
£  =  F  m 

where  m  is  the  vector  of  measurements  defined  by  (2. 16),  and  £  is  the  vector 
of  estimates,  made  up  in  our  case  of  the  <3a°^ .  According  to  (2.11)  and  (2. 16) 

£  -  £  ~  F  (Js  +  n) 

In  general,  the  estimates  will  not  be  exactly  equal  to  that  which  is  estimated, 
the  difference  being  the  estimation  error.  In  matrix  notation 

e  =  c  -  c  =  (c  -  F  z )  -  (F  n)  (2. 19) 

e  being  the  estimation  error  vector.  The  two  terms  in  the  expression  above 
can  be  defined  as  the  components  of  this  error: 

e ,  =  c  -  F  z 

which  is  the  estimation  error  in  the  case  of  noisless  (perfect)  data;  and 

2.V  =  F£ 

which  is  the  error  due  to  the  noise,  or  propagated  noise. 

The  error  eIp  may  be  due  to  a  number  of  reasons.1  If  it  is  zero  for 


1Using  the  relationship  p  =  na  +  an  +  m  +  l  of  paragraph  (2.2),  e,p  stands 
for  the  sampling  error  in  d&9 ,  and  e^p  for  the  propagated  noise. 


some  estimator,  then  its  presence  in  other  estimators  could  be  blamed  on 
them  being  somewhat  inadequate.  For  instance,  if  the  estimator  was  chosen 
by  taking  the  elements  of  F  from  a  set  of  random  numbers,  then  the  estima¬ 
tion  error  is  likely  to  be  always  high,  as  the  estimator  has  nothing  to  do  with 
the  actual  problem.  In  particular,  the  addition  of  extra  measurements  to 
the  vector  m  is  not  going  to  bring  any  general  improvement  on  the  estimates. 

On  the  other  hand,  if  attention  is  paid  to  the  nature  of  the  problem  when  selecting 
F,  one  would  expect  the  error  to  decrease  as  more  data  is  introduced.  If, 
as  the  number  N*  of  samples  in  m  tends  to  infinity,  e*p  tends  to  zero, 
one  could  say  that  the  error- is  due  to  the  incomplete  sampling  of  the  signal 
f  (  9,  X) ,  and  call  it  the  sampling  error.  This  is  precisely  the  case  with 
any  of  the  quadrature  formulas  to  be  studied  here,  all  of  which  can  be  written 
formally  as  linear  estimators  according  to  (2. 11),  and  for  all  of  which  the 
error  eip  vanishes  as  the  number  of  samples  tends  to  infinity,  because  the 
sums  become  identical  with  the  Integrals  defined  by  (1.4).  In  this  sense  it  is 
quite  suitable  to  call  e,p  the  sampling  error,  as  in  paragraph  (1.3). 


2.4  The  Quadratic  Error  Measure 

The  overall  error  measure  will  be  defined  here  as  the  sum  of  two  quad¬ 
ratic  terms:  onefor  the  propagated  noise,  the  other  for  the  sampling  error. 


(a)  Propagated  Noise  Measure 


This  measure  is  the  same  as  in  least  squares  adjustment,  i.e. ,  the 
variance  of  the  error  defined  in  terms  of  the  usual  statistical  expectation 
operator 

=  E{(ff/n)8}  (2.20) 

according  to  (2. 10).  This  variance  represents  the  scatter  in  the  value  of 
<5®  due  to  the  uncertainty  in  the  values  of  the  data.  In  matrix  form 

Etj  =  E  {e^}  =  E  {FnnT  FT  }  »  F  E{nnT}  FT  =  FD  FT(2.21) 

where  Efj  is  a  N„  x  N,  matrix,  while  D  was  already  presented  in  paragraph 

(2.2). 


In  the  special  case  where  the  measurement  errors  are  uncorrelated, 
D  is  diagonal,  and  (2.20)  becomes 


a  „  f,  CtT  T  ,a  1  ,a  T  a 

°  rf  ~  E  iln*  °  2.  in*  J'  =  in*  Din*  = 


V  c  cos  a\  ,  3  ^ 

L  (X “*)  l  lsinaim^XE  Inu/ 

1  =0  3  =  0 

which  Is  the  usual  formula  for  propagating  the  covariance  of  the  noise. 


(2.22) 


(b)  Sampling  Error  Measure 


This  measure  is  defined  in  terms  of  the  isotropic  averaging  operator  of 
paragraph  (2.1) 

a.2„f=  m{(c,  -f“  £ )2}  =  M  {(Cn“  )a}  (2.23) 

or,  in  matrix  form 

Es  =  M-fe.el]-  =  M  {(c  -  Fz)(c  -  Fz)T}  (2>  24) 

=  C  -  2  C  c,  Ft  +  F  Czz  Ft 

where  E,  is  a  Na  x  Ne  matrix,  and  C  ,  C  0i  ,  CZ2  were  introduced  in 
paragraph  (2.2). 

(c)  Total  Error  Measure 


The  total  measure  is  the  sum  of  (a)  and  (b) 


+  o« 


or,  in  matrix  form, 


Er  =  Es  +  E^  =  C  -2C8ZFT  +  FCnFT  +  FD  FT  = 
C  -  2Caz  FT  +  F(Ct«  +  D)  Ft 


(2.  25) 


(  2.  2G) 


where  E  T  is  the  NpxN„  error  matrix  associated  with  F  and  with  the 
covariances  that  define  C  ,  Ccz  and  (CIZ  +  D).  Expression  (2.2G)  is  a 
special  case  of  the  formula  for  "E,  5"  in  least  squares  collocation  (for  instance, 
Moritz  (1978),  Ch.  3,  eqn,  (3.20));  moreover,  it  belongs  to  a  family  of 
formulas  also  found  in  the  minimum  variance  estimation  and  filtering  of  time 
series  and  of  processes  sampled  on  the  euclidean  plane. 

The  total  measure  has  been  chosen  simply  as  the  sum  of  as3n“ + 
by  making  the  basic  assumption  that  the  sampling  error  and  the  propagated 
noise  are  due  to  completely  independent  causes.  The  first  depends  on  the  values 
while  the  second  depends  on  the  measurement  errors  of  instruments 
that,  at  least  ideally,  operate  with  accuracies  unaffected  by  the  quantities 
measured,  or  in  such  way  that  any  Interactions  can  be  eliminated  by  simple 
corrections. 


The  columns  of  F  are  defined  by  the  quadrature  formula  used,  and 
such  formulas  either  satisfy,  or  tend  to  satisfy,  orthogonality  conditions  (para¬ 
graph  (1.  3  (c)).  For  this  reason,  provided  that  Czz  and  D  belong  to  the  type 
to  be  described  in  paragraph  (2. 9),  matrices  E^,  Er.  ,  and,  thus,  E T  ,  arc  cither 
diagonal  or  diagonal  dominant,  and  in  the  latter  case  tend  to  become  diagonal  as 
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the  sampling  intervals  decrease,  or  Np  ->  “.  For  this  reason  the  correlations 
among  the  erx’ors  for  individual  coefficients  are,  or  "tend  to  be",  very  small. 

The  diagonal  elements  of  the  error  matrices  Es  ,  E„  and  Er  are  the 
variances  of  the  errors  in  the  respective  coefficients,  as  defined  by  (2.20), 
(2.23),  and  (2. 25),  respectively. 


2.5  The  Meaning  of  the  Error  Measure 

The  treatment  of  the  propagated  noise  is  the  same  as  in  least  squares 

adjustment,  so  this  part  of  the  error  measure  should  be  easily  under¬ 
stood.  The  sampling  error  measure,  on  the  other  hand,  is  a  geomet¬ 
rical  measure;  M  {  }  belongs,  as  a  concept,  in  the  field  of  integral  geometry, 
or  the  study  of  "geometric  probabilities”.  This  is  a  branch  of  mathematics 
closely  related  to  integration  and  to  measure  theory,  and  also  to  statistical 
mechanics.  In  geodesy,  this  type  of  idea  is  relatively  new  (Kaula,  1959), 

Moritz  (1965),  but  it  has  been  used  already  extensively  enough  to  show  its 
considerable  worth. 

From  expressions  (1.10)  and  (1. 11),  the  covariance  and  the  power  spec¬ 
trum  are  functions  of  each  other.  Since  either  of  them,  and  the  sampling 
grid,  define  matrices  C  ,  C8Z  and  Czz  in  expression  (2.  24),  it  follows  that 
a  statement  on  af„,  is,  somehow,  also  a  statement  on  the  performance  M  F 
for  all  the  functions  that  have  the  same  power  spectrum  that  determines  the 
diagonal  elements  of  C  .  To  put  this  more  precisely,  consider  a  function  fx  (0,/\) 
having  the  given  power  spectrum.  If  C„®  were  estimated  for  fx  and 
also,  at  least  ideally,  for  all  its  rotations,  then  the  mean  square  of  the  sampl¬ 
ing  error  e>p  in  Ca®  for  all  this  functions  would  be,  by  definition  of  M  J  , 
the  measure  cr .  If  a  second  function  fa  (perhaps  not  a  rotation  of  f  t)  and 
all  its  rotations  were  then  analysed  In  the  same  way,  the  average  of  efD  for 

?  fy  ' 

all  these  functions  would  be,  once  more,  cr,  nl  ,  as  long  as  f3  has  the  same 
spectrum  as  fp  .  Moreover,  the  average  of  e^  for  fj  ,  f3  ,  and  their  ro¬ 
tations  put  together,  would  also  be  crfBB  .  In  fact,  if  we  had  a  finite  set  of 
functions  f*  ,  fa  ,  .  .  .  fB  ,  with  arbitrary  n  ,  all  with  the  same  power  spec¬ 
trum  (or  covariance),  then  cfp  would  average  crfa^  for  all  the  fi  and  their 
rotations. 

It  appears,  from  the  preceeding  discussion,  that  one  could  take  a  simple 
step  and  say  is  the  mean  of  the  sampling  error  squared  of  the  estima- 

tor  =  f  ®B  £  ,  over  all  possible  functions  with  the  given  power  spectrum.  " 
Unfortunately,  as  mentioned  in  the  Introduction,  the  sphere  is  a  rather 
wicked  surface.  There  is  a  theorem  by  Lauritzen  (1973)  that  states  the  im¬ 
possibility  of  having  the  same  average  crfBB  for  all  functions  as  for  evei'y 
function,  when  the  distribution  of  the  ensemble  happens  to  be  gaussian.  Moritz 
(1978)  has  endeavoured  to  show  that  this  is  no  problem  if  the  ensemble  of  functions 
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is  not  gauss ian,  hut  using  his  conclusions  here  would  force  the  introduction 
of  a  rather  strange  requii’emcnt  of  "non-gaussness"  on  the  ensemble  of  the 
signals  analysed  that  is  best  left  out,  if  possible. 

Perhaps  there  is  a  way  out  in  going  back  to  the  idea  of  a  finite  set  of  func¬ 
tions  f  t  ,  where  the  problem  does  not  exist,  by  saying: 

"the  error  measure  a3sn^ ,  for  a  certain  estimator  and  a  certain  type  of 
signal  power  spectrum,  is  the  average  of  the  square  of  the  sampling  error  in 
for  all  functions  with  the  given  power  spectrum  KVKK  TO  RJC  ANALYSED 
with  that  estimator,  and  for  all  their  rotations". 

After  all,  accuracy  is  what  geodesists  are  always  interested  in,  not 
perfection. 


2.6  Simple  Formulas  for  Area  Means 

The  numerical  studies  of  section  3  concentrate  in  area  mean  type  for¬ 
mulas,  because  area  means  are  preferred  for  collating  information, 
particularly  on  a  global  basis,  at  present.  The  formulas  to  be  studied  here 
and  in  that  section  can  be  divided  into  "simple"  and  "optimal".  The  name 
"simple"  is  given  here  to  expressions  of  the  type 
Aa  n-i  am  f  __ 

Cna  =  Mn  Z  l  ft,  L  Xi.  (^X)dcr  (2.27) 

)=Oj  =  0  J °t  3 


where  jxn  in  a  scale  factor  affecting  the  nth  harmonic  as  a  whole.  Ex¬ 
pressions  of  this  type  have  been  developed  more  or  less  intutitively,  along 
the  lines  of  the  following  reasoning: 


If  the  signal  were  constant  on  each  block,  it  will  equal  its  mean  value 
there,  and  the  coefficients  of  such  a  fuention  would  be  precisely 


1 

4  TT 


N~l 

I 

1  =  0 


(M)  dc 


(2.28) 


according  to  (1.4).  In  general,  most  signals  are  not  equal  to  their  mean  value 
over  whole  blocks,  so  the  expression  would  not  be  exact.  In  most  eases, 
the  signal  would  have  fluctations  in  each  block,  and  it  would  be  less  smooth 
than  a  function  that  is  constant  overreach  block,  so  using  the  formula  above 
withTii  as  data  may  result  jn,the  C®  of  a  smoothed  function.  As  a  refinement, 
one  could  try  to  do-smooth  the  Cn^  .  If  the  blocks  were  circular,  the  relation¬ 
ship  between  "true"  and  "smooth"  would  be 
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(2.  29) 


where  /Sn  is  known  as  the*  Pellinen  smoothing  factor  of  degree  n 
The  relationship  between  /3n  and  the  radius  of  the  circular  blocks  is  given  in 
paragraph  (4.3).  For  small  blocks,  experience  shows  that  there  is  little 
difference  between  the  area  means  of  geodetic  data  on  circular  or  on  square 
blocks,  so  the  error  is  small  if  one  assumes  that  they  are  the  same;  in 
such  case  the  modified  expression 

N-l  3N-1 


4nj3n 


i-o 


I 

i~  o 


/  V- “ 


Yna(0,X)dcr  =  c 


a 


(2.30) 


could  be  used;  in  practice,  this  is  only  an  approximation, though  a  good  one, 
as  showed  by  Katsambalos  (1979),  who  tested  this  expression  extensively. 


In  addition  to  (2.28)  and  (2.30),  Lowes  (1978)  has  proposed  using 


A 

C 


a 

na 


4ttV 


N-l  »  N— 1_  /.  (y 

l  l  U3  YnB(0,X)d a 
i  =  o  3  —  o  Jasj 


(2.31) 


to  estimate  the  harmonic  coefficients.  All  these  expressions  have  the  prop¬ 
erty  that,  because  -  1  ,  and  }  (0»X)  do  -Joi}  f(0,X)  Ya(0,X)do 

as  -  0  (or  Np  -  00 ),  it  is  true  that  the  error  e,P  =  c®  -  Cn^  -*  0  with 
NP  -*  00 ;  in  other  words:  egp  is  properly  called  a  sampling  error  in  the  sense 
given  to  this  term  in  paragraph  (2.3). 


Comparing  (2.28),  (2.30),  and  (2.31)  it  is  easy  to  see  that  they  all  belong 
to  a  class  of  express ionsof  the  form  (2.27),  with  ,  and 

M a  =  JTFg'a  ,  respectively. 


The  scaling  factor  /in  can  also  be  regarded  as  a  de-smoothing  factor,  if  one 
wishes  to  retain  the  intuitive  meaning  of  these  formulas.  In  the  notation  of  para¬ 
graph  (2.2),  these  expressions  can  be  written,  according  to  (2.27),  as 


nm 


a  T 

Madina)  31 


with 


Mnh 


a 

no 


(2.  32) 


Replacing  (2.32)  in  the  definition  of  the  sampling  error  measure,  (2.23), 
and  adding  with  respect  to  m  and  a  to  obtain  the  total  error  in  the  nth  harmonic: 

T  T  n a  =  °i?  “  f"2  Y.  T  — *  ilo*]  Mn  + 

Ci=z  O  a  —  o  La=0  a  =  o  J 

it  T  (1»V(C,,)  hfj  Mn2  (2.33) 

|p£—  o  o -o  J 
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r 


(where  z  is  a  row  of  Cci).  This  is  the  sum  of  certain  diagonal  ele¬ 

ments  of  Es  ,  according  to  (2.24),  when  the  estimator  has  the  form  (2.27). 
Clearly,  (2.  33)  is  a  quadratic  function  of  the  scalar  ,  and  as  such  it  can 
have  either  a  maximum  or  a  minimum.  If  Czz  is  positive  definite,  it  must  be  a 
a  minimum.  Finding  the  corresponding  value  of  pn  is  the  same  as  finding 
the  formula  of  type  (2.27)  that  has  the  smallest  sampling  error  per  harmonic 
for  signals  with  the  covariance  (power  spectrum)  specified  by  Czl  .  In  ad¬ 
dition  to  the  sampling  error,  the  measure  of  the  propagated  noise  can  be  added 
to  obtain 


I  I  "i?"  CrjM 

Z  I2 

a_=0 

?=0  ,=rO 

r  '  ”  ,  a  t 
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(2.34) 


This  is  also  quadratic  and  has  a  minimum,  and  finding  the  optimum  pn  is 
the  subject  of  the  next  paragraph. 


2.7  Optimum  de-Smooth ing  Factors 


The  coefficients  of  pn  and  p  n  in  (2.33)  and  (2.34)  are  both  real 
scalars,  and  so  is  the  independent  term  of  .  The  expressions  represent 
parabolas,  and  because  both  CzZ  and  D  are  positive,  if  the  further  (and 
likely)  assumption  is  made  that  they  are  also  definite,  then  b'!  V  V  >  0, 

Tut  <T  Vi¬ 


and  the  parabola  has  a  minimum  where  pn  satisfies  the  condition 

I  I  otZ  “  -I  Ic u,  h“  * 

«  »  a  i 


1  i. 

2  dp, 


X  (h?,)  C.j  hj, 


a 

Mn 


for  the  sampling  error  (2.  33 


i.e. ,  at 


or  at 


P  a  = 


* 

Pn 


^V'£.raOl,z  ]lnn 

l  J(h“)C„h^ 
\  X  £»»«,»  il«a 


(2.  35) 


(2. 36) 


for  the  total  error  (2.  34). 

Expressions  (2.  28),  (2.30),  (2.31),  and  (2.36)  will  be  studied  further, 
by  means  of  computed  examples,  in  section  3. 
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1 


2. 8  Least  Squares  Collocation 

In  the  notation  of  Paragraph  (2.2),  optimizing  /a a  is  the  same  as  ob¬ 
taining  the  optimal  vectors  _f  of  the  form 

.a 

for  the  estimator 

a  _ 

£  =  Fm 

where  m  =  £  +  n  ,  and  the  (f^1  are  the  rows  of  F  .  If  no  restriction  is 
placed  on  the  form  the  rows  of  F  can  take,  then  a  reasoning  similar  to 
that  in  the^ireceeding  paragraph  leads  to  the  best  possible  linear  estimator 
for  the  C„  . 


Considering  the  total  measure  of  error  for  0  sn^N  : 

I  i*-l  «  ~  I  N-l  a  n 

°?  -  E  E  £  <  =  E  - 2  III 

a=oa=o«=:o  a=o  a=o  »=o  *=o 


+ 


M-l 


I 

a=o  u=o«=o 


I  I  (f“)T(C„+D)il 


a 


(2.37) 


it  is  not  difficult  to  see  that,  because  all  afB  are  non-negative,  finding  the 
F  that  minimizes  their  sum  is  the  same  as  finding  the  F  that  minimizes 
them  individually.  The  sum  of  the  mean  squared  errors  of  all  coefficients  is 
the  trace  of  the  error  matrix  ET  of  (2.26)  : 

-  i  ?  t  <4a  -  tr[ET] 
ot=o»  =  °  *=° 

To  obtain  the  condition  for  a  minimum,  one  must  differentiate  (2.37)  so, 
according  to  (2.26), 

\  ff-  [Erl  =  -clt  +(C„  +  D)  Ft  (2.38) 


as  found  using  well-known  matrix  analysis  formulas. 

From  this  follows  that 

F  =  C,t  (C„  +D)'1  (2.39) 

is  the  F  that  minimizes  (2.37),  provided  that  (C„  +  D)  Is  positive  definite. 

As  already  explained,  both  matrices  are  always  positive  and  their  sum  is 
usually  definite.  The  expression  for  the  optimal  estimator  for  is 

=  (ffjT  m  =  (C ix  +  D  f1  m  (2.40) 
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where  (f“)T 

ft®. 


is  the  row  of  the  optimal  estimator  matrix  F  corresponding  to 


The  use  of  expression  (2. 40)  is,  in  brief,  least  squares  collocation 
applied  to  spherical  harmonic  analysis. 

When  the  optimal  F  is  used,  the  error  matrix  becomes,  according 
to  (2.26)  and  (2.39), 

Et  -  C  -  Co*  (Cjj  +  D)  Cox  (2. 41) 

and  the  total  error  measure  is  the  trace  of  this  rratrix:  by  definition,  the 
smallest  for  all  possible  F. 

Clearly,  whether  one  is  interested  in  estimating  coefficients  or  in 
determining  the  likely  accuracy  of  such  estimates,,  using  expressions  (2.40) 
or  (2. 41)  require  a  knowledge  of  either  (Ctx  +  D)"1  (inversion)  or,  at  least, 
of  C3  x  (CX1  +  D)-1  (solution).  Because  the  matrix  (Cxr  +  D)  has  dimension 
xNs  ,  obtaining  either  requite  s,  by  usual  linear  algebra  methods,  O(Np) 

(or  0(N 6 )), operations.  In  the  case  of  a  1°  x  l°grid,  N„  =  64800,  so,  at  some 
200000  products  and  sums  (double  precision)  per  second,  a  modem  computer 
like  the  one  at  OSU  would  need  about  one  century  to  obtain  all  6?,  to  degree 
and  order  180  from  data  on  such  a  grid.  Fortunately,  as  explained  in  para¬ 
graph  (2.9),  if  the  covariance  functions  of  signal  and  noise  both  satisfy  certain 
conditions,  and  if  AX  is  constant  for  the  whole  grid,  then  both  Cjx  and  D 
(consequently  their  sum)  can  be  inverted  in  much  fewer  operations  than  by  con¬ 
ventional  methods,  because  they  possess  a  particularly  strong  structure.  More¬ 
over,  the  optimal  estimator  C  ®  m  turns  out  to  be  of  the  form  (1. 24) 

or  (1. 7),  depending  on  the  kind  of  data  m ,  so,  under  rather  general  conditions, 
the  optimal  estimator  of  Is  also  the  best  quadratures  type  formula  for 
point  data  or  for  area  means,  as  the  case  .may  be. 

The  conditions  mentioned  above  are  satisfied,  for  instance,  when  both 
ihe. geometrical  covariance  cov(f(P),f(Q))  and  the  stochastic  covariance 
E  ^nijOint  j  (P  s(0ltXj)  ,  Q  =(9it,>*)  are  Isotropic,  i.e. ,  functions  only  of 
the  separation  between  the  points  p  and  Q  .  By  definition  of  M  i  j ,  the 
geometrical  covariance  obtained  using  this  operator  is  isotropic,  so  C,z  has 
the  desired  structure.  A  common  assumption  regarding  good  instruments 
is  that  the  nl3  are  uncorrelated,  so  D  Is  diagonal.  If  the  errors  are  sta- 
tionary,  so  their  variances  are  constant,  or  at  least  constant  along  parallels, 
then  matrix  D  has  the  required  structure,  and  inverting  C*x  +  D  can  be 
greatly  expedited.  In  practice,  however,  this  Is  not  likely  to  be  the  case, 
as  the  number  and  quality  of  measurements  will  vary  from  region  to  region, 
resulting  indifferent  cfj  both  globally  and  along  parallels.  As  a  result,  the 
best  linear  estimator  in  terms  of  the  chosen  error  measure  will  not  have  the 


-37- 


quadratures  form,  and  it  will  be  very  difficult  to  compute  whenthe  number  of 
data  values  is  very  large.  Nevertheless,  as  shown  in  section  3,  quadrature 
formulas  caa  give  reasonable  estimates  of  the  Ca^  with  noisy  data  sets  where 
the  noise  is  uneven,  so  it  would  be  interesting  to  get  the  best  quadrature  for¬ 
mula  for  a  particular  combination  of  signal  and  noise,  provided  that  such  a 
formula  can  be  obtained  without  undue  effort. 


2.9  The  Best  Quadrature  Formula  for  non-Unlform,  Uncorrelated  Noise 

If  the  variance  of  the  noise  fluctuates  along  parallels,  matrix  D  ,  though 
diagonal,  is  such  that  the  minimization  of  the  error  measure  (2.37) 

-  tr  [C  -  2C„  F T  +  F(C„  +  D)  FT  ]  «  #  (D)  (2. 42) 

(see  also  (2.26)) 

can  be  very  difficult  with  large  Np  ,  and  the  optimal  estimator  is  not  of  the 
quadrature  type.  Introducing  a  "modified  noise  matrix"  L  ,  also  diagonal 
and  where  the  diagonal  elements  are 

3N-1 

(2. 43) 


la  =  gj  [  o?j  (with  k  =  2N  i+j) 


the  following  modified  error  measure  4(L)  can  be  defined: 

4(L)  =  tr  [C  -  2C„  FT  +  F(C„  +  L)  FT] 

The  optimal  estimator  for  this  measure  is  easy  to  obtain,  and  is  of  the 
quadratures  type. 


(2.44) 


The  parts  of  4  (D)  and  4  (L)  that  measure  the  sampling  errors  are 
identical,  so  any  difference  between  the  overall  measures  must  come  from 
the  "noise  propagation"  parts  tr  [FDFr]  and  tr  [FLFT]  .  If  the  estimator 
(not  necessarily  optimal)  happens  to  be  of  the  quadratures  type,  i.e. ,  for 
point  data: 

=  Y  Y  x!'  {sto}  +  A|]  (2*45) 

1*0  4  =  0 

then  the  propagated  noise  is,  assumming  the  n  to  be  uncorrelated, 

-  X  X  «■••>■  fcft 

IN-4*  N-l  D  N-l  SN-a 

tyD)  =tr[FDFT]  •£  £  f  t  Z  (X?-)3  £ 

'  1*0=0  '  B  =  0  ■= 0  1  =  0  1  =  0  (2. 


so 


(2.46) 


The  "modified  noise','  on  the  other  hand,  is,  according  to  (2. 43)  and  (2.44): 


r 


h  s  u  a*“i 

$JL)  =tr  [FLFt]  =  I  L  I  (XiB)a  I  (cosam  j  AX  +  sin2mjAX) 

^  «  =  0»  =  °  1 = 0  3  =  0 


3N-1 


*  E>,-t  £  7WT 

*NJ  =  °  b=0»  =  0  1=0  3=0 


(2.47) 


Comparing  the  expressions  for  [FLFT]  and  for  [FDFT]  ,  it  follows 
that  they  are  Identical,  and  since  the  "sampling"  parts  are  also  identical  in 
(2.42)  and  (2.44),  then 


tr  (C  -  2CciFt  +  F(C,t  +  L)  FT]  =  tr[C  -  2Cai FT  +  F  (C„+D)FT] 

(2.48) 

This  means  that  the  actual  and  the  modified  error  measures  must 
coincide  if  the  estimator  is  of  the  quadratures  type. 


Replacing  D  with  L  in  equation  (2.  38)  and  solving  for  the  estimator 
matrix,  one  gets 

Ft  «  Ca*(Clt  +  L)”1  (2.49) 

where  Ft.  is  the  estimator  matrix  that  minimizes  the  modified  error  measure 
(2.44).  Because  of  the  way  L  has  been  defined,  this  estimator  is  of  the 
quadratures  type,  so  the  modified  and  the  actual  error  measures  coincide,  as 
just  shown. 

Assume  that  there  is  an  estimator,  different  from  c  =  FL  m  but  also 
of  the  quadratures  type,  the  estimator  matrix  of  which  is  F,  and  such  that: 

tr  [C  -  2C„  FT  +  F(C„  +  D)FT]  <tr  [C  -  2C0I  Fj  +  FL(C„  +  D)  FLT  ] 


Then,  according  to  (2.48), 

tr  [C  -  2Csz  FT  +  F  (C„  +  L)  FT  ]  <tr  [C  -  2CM  F[  +  Fu  (Czz  +  L)  f[  ] 

(2.  50) 

which  contradicts  the  fact  that  Fl  minimizes  the  modified  error  measure. 
Therefore,  (2,50)  cannot  be  trie ,  and  FL  must  be  the  matrix  of  the  optimal 
quadratures  type  estimator  that  minimizes  the  actual  error  measure  o2  . 

The  optimal  quadratures  type  estimator,  as  the  name  indicates,  is  the 
best  of  a  certain  kind,  not  the  absolute  best.  The  best  estimator,  when  no 
conditions  as  to  its  form  are  imposed,  will  not  be  (in  general)  of  the  quadra¬ 
tures  type,  unless  D  happens  to  have  the  "right  form"  specified  before, 
i.e. ,  unless  D  =  L. 

When  D  =  L  ,  the  optimal  estimator  and  the  best  quadrature  formula 
coincide.  Regardless  of  this,  the  quadrature  formula  obtained  from  (2. 49) 
is  the  best,  so  its  error  measure  is  a  lower  bound  for  those  of  all  other 
quadratures  formulas  with  the  given  signal  and  noise. 
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When  D  /L,  while  minimizing  the  sum  of  all  error  variances  i.e., 

tr  (h>),  the  optimal  quadrature  formula  does  not  minimize  each  individual 
variance  <"r3  .  Tn  slmw  fhls.  onnsidcr  thr>  nmnap'ated  noise  measure  for 


variance  aBa  .  To  show  this,  consider  the  propagated  noise  measure  for 
when  the  n4l  are  uncorrelated  (to  simplify  the  argument): 

<r^-  =  £  (x?f  r  [sto}  mjAX  o?j  =  i  (xf)2i  r  + 

1  1=0  J=o  L  J  1=0  J=0  i  = 

The  modified  error  measure,  minimized  by  the  formula,  is 

*j-l  »N-1  r  ,2^  3N-1  N-l  3N-1 

a  I  of,  =  l  (x!')2  it  of, 

1=0  3=0  °  ^^3-0  1  =  0  3  =  0 

Clearly,  both  are  not  the  same,  unless 


IW 


2mjAX(T1,  =  0 


which  is  not  likely  to  be  fulfilled  for  arbitrary  a*  .  However,  looking  at  the 
reasoning  which  leads  to  (2.47) ,  one  can  see  that  the  sums  of  the  modified  and 
the  actual  error  measures  for  pairs  (Cna  ,  SnB),  and  also  for  the  individual  C^0  , 
are  already  identical.  From  this  follows  that  the  variance  of  the  error  per  degree 


i  t  <a  - 


a-oB=0 

and  per  average  coefficient  per  degree: 


6<f„  - 


2n  +  1 


(2.51) 


(2.  52) 


are  also  identical  to  the  modified  measure.  So,  while  nothing  can  be 
predicated  of  individual  coefficients,  the  error  for  each  harmonic  as  a  whole 
and  that  for  the  "average  coefficient"  in  it  are  going  to  be  minimum.  By  Parseval's 
identity  (1.12),  if  the  coefficients  were  used  to  calculate,  say,  geoidal  undula¬ 
tions,  the  mean  squared  error  of  the  computed  geo  id,  globally,  would  be  tie  same 
as  the  sum  of  the  error  squared  of  the  normalized  coefficients,  so  individual 
coefficient  variances  are  of  little  interest  in  this  and  similar  applications, while 
the  afn  are  very  important.  This  shows  that  the  optimal  quadratures  formula 
when  D  /  L  can  be  just  as  useful  as  when  D  =  L. 

The  discussion  in  this  paragraph  lias  been  centered  on  point  value  type 
formulas;  the  conclusions  apply  equally  well  to  area  mean  type  formulas,  the 
extension  of  the  reasoning  being  quite  straightforward. 


2. 10  The  Structure  of  the  Covariance  Matrix  and  its  Consequences 

The  following  discussion  summarizes  some  results  presented  by  this 
author  in  a  previous  report  (Colombo,  1979a).  In  order  to  be  able  to  calculate 
the  variance  of  the  error  cr®®  with  expressions  such  as  (2.26),  and  also  to  be 
able  to  obtain  the  optimal  estimator  according  to  collocation  theory,  it  is  neces¬ 
sary  to  create  and  invert  the  Np  x  Np  matrix  (C„  +  D),  which  can  be  very 
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large  if  the  number  of  data  Np  is  large.  In  the  case  of  regularly  sampled  data 
this  two  problems  can  be  greatly  simplified  if  the  covariances  and  the  grid 
has  certain  symmetries.  The  most  important  of  these  are-  (a)  the  sampling 

mjongitudg  must  be  at  constant  interval  and  along  parallels  (or  parallel  bands, 
I.e. ,  rows  of  blocks) ;  (b)  forgiven  i  and  p  the  covariances.  cov(u(0i  ,A,), 
vWp»>q))  (or  cov^,  vPq))  and  Ejn„,  np,},  must  depend 

only  on  Ij-ql  .  It  is  also  very  advantageous,  though  not  essential,  that  the 

grid  be  symmetrical  with  respect  to  the  Equator. 

In  what  follows  Nr  is  the  number  of  parallels  and  N1  the  number  of  meridians 
(Nr  =  N,  N1  =  2N  when  the  grid  is  equal  angular). 


Under  this  set  of  conditions,  if  the  data  vector  m  is  ordered  according 
to  (2. 8)  and  is  subdivided  into  partitions  nq  ,  where 

=  [m,0  mtl .  .  . 

includes  all  data  values  in  the  same  parallel  or  row  of  blocks,  then  the 
matrix  (Czt+D)  can  be  partitioned  into  blocks  Clp  ,  each  of  dimension 
N1  x  N1  ,  containing  the  covariances  between  the  data  along  rows  i  and  p  . 


Each  block  Clp  has  a  Toeplitz  circulant  structure,  because  its  elements 
satisfy  the  relationships 

ciq  =  c  wq+i  5  ® 3 o  =  cri  Ni“i  when  j  >  0 


which  follow  from  the  fact  that  parallels  are  circular,  and  that  the  covariance 
between  points  in  parallels  i  and  p  is  a  function  of  |j-q|  .  Moreover, 
the  elements  in  the  first  row  or  column  (the  CIp  are  symmetrical)  also 
satisfy 

Co,  ~  co'm  when  q  >  0 

Therefore,  the  first  row  can  be  represented  exactly  as  a  sum  of  iNl  +  1 
cosines: 

=  Z  a-P  cosmjjj-  q  (2.53) 

■  O 

The  alBp  form  the  discrete  Fourier  transform  of  the  sequence 


c&. 


>  >r 

■'O  i? 


c 


ip 

0  Mi—  1 
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If 


r1,5  =  Hai?  (2.54) 

where  H  =  |N1  if  m  =  0 

'■Nl  if  m  ±  0 

T 

If  R(m)  is  the  matrix  where  each  "ip"  element  equals  r*,p  ,  then  ( as 
explained  by  Colombo, (op.  clt. ))  Inverting  (C2I  +  D)  is  equivalent  to  in¬ 
verting  the  Nr  x  Nr  matrices  R(m)  for  m  =  0,  1,  .  .  .  -|N1 
Isotropic  covariances  satisfy  the  "  I]  -  ql  condition"  mentioned  above,  so, 
for  fixed  AX  ,  the  covariance  matrix  always  has  this  regular  structure. 


Let 


2-“  -  iQ  {,“} mix-  •  •  •{r}m<Ni-i)ix'r 

A  vector  of  the  type 


a 


=  [v0  c 


at 

o  ilm  » 


ar 

vi  £.«  .  • 


•  VNr_x  c“T] 


(2. 55) 


shall  be  called,  for  convenience,  a  vector  "of  frequency  m". 

Under  the  conditions  described  before  all  the  eigenvectors  of  (C„  +  D) 
are  vectors  of  frequency  m,  with  m^O,  1,  .  .  .  iNl  .  Moreover,  if 
XtB  (f  =  1»  2,  .  .  .  Nr)  Is  one  of  the  Nr  eigenvalues  of  R(m),  and  if 

It.  =  [s1/.  .  .  s^U  ]T 


is  the  corresponding  eigenvector  of  R(m),  then  Xt>  is  also  an  eigenvalue  of 
(Ct*  +  D),  and  the  pair 


is; 


tm  C«T 
o  x*  ’ 


T 


the  two  corresponding  eigenvectors  of  (C„  +  D).  Therefore,  to  decompose  the 
large  covariance  matrix  In  eigenvectors  and  eigenvalues  is  equivalent  to 
decomposing  the  \  Nl  +  1  matrices  R(m)  ,  and  this  is  why  the  latter  are  rele¬ 
vant  to  the  inversion  of  (CM  +  D):  the  eigenvalues  of  the  inverse  are  the 
reciprocal  of  the  Xts  ,  while  its  eigenvectors  are  the  same  as  the  .  Further, 
this  implies  that  (Ctl  +D)-1  has  the  same  structure  as  the  covariance  matrix, 
l.e. ,  it  consists  of  Toeplltz -c irculant  blocks. 

Since  (C„  +  D)-1  has  eigenvectors  of  frequency  m,  then,  if  h  is 
a  linear  combination  of  vectors  of  a  given  frequency,  z  =  (C  „  +  D)_1  h  is 
also  a  linear  combination  of  vectors  of  that  frequency.  In  the  case  of  point 
data,  from  expression  (2.6)  follows  that  the  cross-covariances  vector  in 
(2.40)  is 
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T  OS  ..GO  -  r,  at 

£«»«*  2n~TT  lp»(cose0)  CB  •  .  .  p„  (cos0Mr.i)  c,  ] 

=  [ k  o“  c“r  .  .  .  k^._,  C?) 


Define 


kn"  =  [kS“ .  .  .  ksr-i]' 


(2.56) 


ns  r  no  ns 

i.  L  Xo  •  •  •  Xn  r— 1  i 

*  a  , 

Then  fnB  =  (Cz2  +  D)  c_ ,, .. a, ti  must  be  of  the  form 
fa  _  ,  n»  aT  t*  odJ 

_i  n  b  "  f  Xo  2.  r  •  •  •  Xm  r— l  2.  n  J 
where,  according  to  Colombo  (ibid). 

~  l  ,  n  a 

X  =  K(m)  k 


(2.57) 


(2.58) 


Similarly,  for  area  means, 

3  re^.d_  rx^ aa  t 

£bb0>i  -  - l-  .  •  /  P0»  (cos6)  sinOd  0/  c“tlX.  ..] 

(2n  +  1)A*i  Je  i  Jx, 

=  “I  [.  .  .  f’*4L(cose>Stoede(/A(ra>U  JB<m)  |  <*)'•.•]'  ^ 

(2n  +  1)  A  jj  J0i  001  VNj  t  A(m)f  “7 

(where  A  is  supposed  to  be  independent  of  j  >  according  to  expressions  (1.  7) 
and  (2.  7),  so 

r  „»/  fA(m)|  o  ,  jB(m)  }  x  \T  T 

ln0  !•  •  •  Xt  ^  £s  +  |  A(m|  -BJ  *  *  *  1 

In  conclusion,  the  optimal  estimator  for  point  values  has  the  form 

£?■  =  (l“)T£i  “Y  T  xr  )s3  (2.60) 

t=o  3  =0  1  > 

while  that  for  area  means  is  of  the  type 

42  -  (If.)’  m  =T  Y  xr  I  cosmjiX  ♦  )»<'">)  stamJAXlm,, 

1=03-.  0  t”'  <)  l  AtmJ  /2.6I) 


|B|m)  \  sin  m  j  AX]  m1>t 
V  APn)  (2.61) 


so  they  are  both  of  the  Quadratures  kind,  as  anticipated  in  the  proceeding  para¬ 
graph. 


2.11  Setting  up  and  Inverting  the  Covariance  Matrix 

Each  block  C1r  of  (Cl2  +  Dt  is  wholly  determined  by  the  1st  ?  N1  +  1 
elements  in  its  fir?t  row;  if  the  number  of  operations  required  to  compute  any 

-42- 


r 


element  of  C*5  is  k  ,  then  only  $N1  +  1)  k  operations  are  needed  per  block, 
instead  of  nA  ,  as  would  be  the  case  if  C *  did  not  have  the  Toeplitz  structure 
described  previously.  This  is  a  reduction  of  the  number  of  operations  by  a 
factor  of  2N1  ,  and  clearly  applies  not  only  to  C1®  but  to  the  whole  covariance 
matrix  as  well.  So  (C 1 1+  D)  can  be  set  up  about  2N1  times  faster  than  an 
ordinary  matrix  of  the  same  size. 


The  total  number  of  elements  to  be  computed  is  -|N1  x  Nr3",  or  N3  in  the 
case  of  equal  angular  grids.  If  the  grid  is  a  fine  one,  this  can  still  be  a  very 
large  number  of  covariances.  This  is  particularly  serious  in  the  case  of  area 
means,  because  the  area  mean  covariances  are  given  by  expressions  of  Ihe  form 


cov  (U{  j ,  u^,, )  =  M 

cov(u(0,A),  u(8',\'))  dodo” 


j  fudcr /uda  ,  ~  f  f  M  ju(0,A)  u(6',  \')j 

J  Op,  Joti  Jo#  1  J 


dada' 
(2. 62) 


involving  double  area  integrals  of  the  covariance  function.  Numerical  quad¬ 
ratures  methods,  such  as  the  one  described  in  paragraph  (4.3),  have  been 
used  in  the  past  to  obtain  cov(ut. ,  u^,)  (see,  for  example,  Rapp  (1977)). 
These  methods  take  so  much  time  in  the  case  of  fine  equal  angular  grids 
for  instance,  that  it  may  be  practically  impossible  to  use  them  to  set  up  the 
covariance  matrix  of  a  global  data  set,  in  spite  of  the  reduction  by  2N1  in 
the  number  of  operations.  Fortunately,  the  coefficients  a*p  in  the  Fourier 
expansion  of  the  elements 


c 


ip  _ 
Ji 


cov(u1JfuM)  +  E 


(expression  (2. 53))  can  be  obtained  by  means  of  a  series  expansion  (truncated 
to  a  conveniently  high  degree  N%x)  according  to  expression  (4. 14)  in  paragraph 
(4. 1).  These  coefficients  are 


where 


I  i 


K  Nnax 

Ia,8Nh+»tf  +  £  Y.  In»aNh-i,i 
Hal  s  =  9 


In»3Nh_«,p  F(m) 


1 

F(m) 


Off  A  0 

,  <oos9>  sm9d6vfrh 

AX3  if  m  =  0 
(2  m"3)  (1-cosmAX) 


_ 1 _ 

AX(cos  8j  -  cos(8j +A0)) 

and  (2h  +  1)  NsN,tJ 


A  similar  reasoning  to  that  for  area  means  leads  to  an  analogous  formula  for 
point  values : 

K  Naax  K  N wax  ^ 

a«  =  [  I  Z  (^«3Nh+  ■  (cos 0j )  PaaMh»»  (cos ^))  +  z  L  (P.aNh-«  (cosSj)  . 
s=0  n=«  -  4=0.  =  «  (2.64) 

•P...W  <«>■*») 
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The  importance  of  (  A  63)  and  (2.  64)  is  that,  if  the  signal  and  noise  are  such 
that  the  number  of  terms  in  the  summations  is  not  too  large  (Ns ,  *  is  a 
"manageable"  number*,  they  allow  the  direct  determination  of  the  elements 
of  the  R(m)  matrices  according  to  (2.54).  In  this  way,  the  R(m)  can  be 
created  without  first  having  to  set  up  the  whole  covariance  matrix  and  then  to 
obtain  the  discrete  Fourier  transform  of  the  first  row  of  each  C Ip  .  This 
advantage  further  increases  in  the  case  when  the  grid  is  symmetrical  with 
respect  to  the  equator,  a  situation  that  applies  to  all  equal  angular  grids. 

Then  each  R(m)  is  persyrametrical,  i.e.,  symmetrical  with  respect  both 
to  the  main  diagonal  and  the  main  antidiagonal,  provided  D  is  also  persvm- 
metrical  (for  instance,  uniform  noise).  This  means  that  only  approximately 
-g-N3  elements  in  each  R(m)  are  different  and  have  to  be  calculated  indb* 
vidually . 

Having  set  up  the  R(m)  without  first  creating  the  covariance  matrix, 
the  inverse  of  (C*  *  +  D)  can  be  found  by  the  equivalent  operation  of  obtaining 
all  R/m)”1  .  The  number  of  operations  in  a  matrix  inversion  is  usually 
0(dimenslon3),  or  0(Ne,i  for  a  covariance  matrix  of  an  equal  angular  data 
set.  The  number  of  operations  per  R(m)  is  0(N?),  or  0(N3)  for  the  equal 
angular  grid.  In  fact,  as  explained  in  (Colombo,  1979a),  the  inversion  of  a 
persymmetrical  R(m)  is  equivalent  to  that  of  two  matrices  of  half  its  dimen¬ 
sion,  one  related  to  vectors  of  frequency  m  of  the  cosine  type,  and  the  other 
to  vectors  of  the  same  frequency  of  the  sine  type.  This  further  reduces  calcu¬ 
lation  by  a  factor  of  .  With  0(N)  R(m)  matrices  to  be  inverted, 
the  total  comes  to  OfN4)  operations,  or  0(NS)  times  less  than  for  the  in  ¬ 
version  of  (C„  +  D)  by  ordinary  techniques  (Choleskii  factorization,  Gauss- 
Jordan  elimination,  etc.).  0(N3)  is  also  the  order  of  the  number  of  data 
points  in  the  grid,  so  in  the  case  of  a  1°  x  1°  equal  angular  grid  with  64800 
elements  the  reduction  in  computing  time  is  0(64800). 

The  numerical  examples  in  section  3  all  involve  5C  x  5°  data  sets  with 
2592  elements,  so  (CK  +  D)  Is  of  dimension  2592.  Settingup  and  inverting 
such  a  matrix  is  a  large  exercise,  even  with  a  modern  digital  computer  such 
as  the  AMDHAL  470  at  Ohio  State,  unless  the  matrix  has  a  strong  structure 
that  can  be  exploited  to  simplify  the  work.  As  such  is  indeed  the  case  here, 
the  subroutine  NORMAL  described  in  Appendix  B  has  been  able  to  do  the  "’hole 
setting  up  and  inversion  in  only  20  seconds. 

The  inversion  of  (C- +  D)  requires  ©(dimension3)  operations  (0(N4)) 
instead  of  0(dlmension3)  because  of  che  Toeplitz  clreulant  structure  of  the 
C55  blocks.  This  "©(dimension3)"  property  is  common  to  other  algorithms 
for  inverting  Toeplitz-type  matrices,  such  as  the  famous  Trench  algorithm 
(Trench,  1965),  and  the  Justice  algorithm  (Justice,  1977),  the  first  for  data 
sampled  on  the  real  line  and  the  second  for  data  sampled  on  the  plane,  So, 
in  spite  of  its  "rather  wicked"  nature,  the  sphere  allows  this  very  convenient 
property  of  regular  grids  to  apply  also  on  its  surface.  In  fact,  not  only  on  the 
sphere,  but  also  on  any  oody  of  revolution  (cone,  oblate  and  prolate  spheroid, 
hyperboloids  and  parabotoids  cf  revolution,  etc.,)  rerJar  sampling  ana 
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covariances  that  satisfy  the  ”  |j-q|  condition"  will  result  in  covariance  matrices 
of  the  type  described  here,  and  this  is  also  true  of  other  matrices  based  on 
symmetrical  kernels,  such  as  the  normal  matrices  of  point  mass  models, 
w^en  the  points  belong  to  a  regular  grid,  etc.  Finally,  the  optimal  estimator 
CnB  =  n}  for  this  type  of  covariance  matrix  is,  as  shown  in  the  previous 
paragraph,  of  the  quadratures  type,  so  the  optimal  can  be  obtained  using 
the  same  efficient  algorithms  described  in  paragraphs  (1.5)-(1.7).  Altogether, 
the  powerfiil  structure  of  the  covariance  matrix  for  regular  global  data  sets 
is  most  remarkable.  One  of  its  many  advantageous  features  is  that,  because 
the  creation  and  invertion  of  each  R(m)  can  be  done  quite  independently  from 
those  of  the  others,  the  algorithms  developed  for  this  type  of  matrices  are 
eminently  suited  for  implementation  in  parallel  processing  computers. 

The  separation  of  the  algorithm  according  to  orders  also  means  that, 
although  setting  up  and  inverting  all  the  R(m)  may  require  a  large  number  of 
o()erations,  only  a  fraction  of  those  actually  correspond  to  the  recovery  of  the 
of  any  given  m,  so  the  numerical  errors  due  to  rounding  or  truncation 
are  not  likely  to  accumulate  to  any  great  extent  in  the  results. 


2.12  Optimal  Formulas  for  non-Uniform,  Correlated  Noise 

Irregular  noise,  already  discussed  in  paragraph  (2.9),  may  be  due  not 
only  to  the  varying  quality  of  the  measurements,  but  also  to  the  way  the  data 
is  "grided",  L  e. ,  the  way  the  value  attributed  to  a  node  (or  block)  ij  is 
obtained  by  interpolation  from  actual  measurements  nearby,  as  usually  data  is 
not  sampled  regularly  on  a  global  basis.  As  the  number,  disposition,  and 
quality  of  the  measurements  used  will  vary  from  point  to  point  in  the  grid,  so 
will  the  accuracy  of  the  interpolated  values.  Furthermore,  even  if  the  measure¬ 
ments  themselves  are  not  correlated,  the  grided  values  may  be  correlated 
because  some  of  the  data  may  be  used  for  more  than  one  interpolated  value. 

This  brings  about  the  question  of  what  can  be  done  when  D  is  neither  diagonal, 
nor  are  the  Dlr  blocks  in  D  ,  corresponding  to  the  C blocks  in  C,*  ,  all 
Toeplitz  circulant.  The  answer  is  a  simple  extension  cf  the  results  already 
obtained  for  Ihe  uncorrelated  case. 

When  the  noise  is  both  non -stationary  and  correlated,  replacing  the  co- 
variances  E  {n14  Ur, }  with 

=  455  <E  {n”  )  +  E  {nn  }  ))  .  where  h  =  J  -  s, 

will  result  in  a  modified  "noise  matrix"  L  where  the  (corresponding  to 
the  D  and  the  C*)  will  be  all  Toeplitz  circulant,  because  the  "covariance" 
cr^ ^  satisfies  the  condition  that,  for  a  given  i  and  r  ,  it  is  a  function  of  |j-s| 
alone.  The  optimal  estimator  for  the  modified  measure 

$(L)  =  trfC  -  2 C«(  FT  +  F(Ct,  +  L)  f} 
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The  importance  of  (2.  63)  and  (2.  64)  is  that,  If  the  signal  and  noise  are  such 
that  the  number  of  terms  in  the  summations  is  not  too  large  (Ns,  *  is  a 
"manageable”  number},  they  allow  the  direct  determination  of  the  elements 
of  the  R(m)  matrices  according  to  (2.54).  In  this  way,  the  R(m)  can  be 
created  without  first  having  to  set  up  the  whole  covariance  matrix  and  then  to 
obtain  the  discrete  Fourier  transform  of  the  first  row  of  each  C1'  .  This 
advantage  further  increases  in  the  case  when  the  grid  is  symmetrical  with 
respect  to  the  equator,  a  situation  that  applies  to  all  equal  angular  grids. 

Then  each  R(m)  is  persymmetrical,  i.e. ,  symmetrical  with  respect  both 
to  the  main  diagonal  and  the  main  antidiagonal,  provided  D  is  also  persym- 
metrical  (for  instance,  uniform  noise).  This  means  that  only  approximately 
-^-Na  elements  in  each  R(m)  are  different  and  have  to  be  calculated  Lndi- 
vidually . 

Having  set  up  the  R(m)  without  first  creating  the  covariance  matrix, 
the  inverse  of  (Cz*  +  D)  can  be  found  by  the  equivalent  operation  of  obtaining 
all  R/m)”1  .  The  number  of  operations  in  a  matrix  inversion  is  usually 
0(dimension3),  or  0(Ne}  for  a  covariance  matrix  of  an  equal  angular  data 
set.  The  number  of  operations  per  R(m)  is  0(N?),  or  0(N3)  for  the  equal 
angular  grid.  In  fact,  as  explained  in  (Colombo,  1979a),  the  Inversion  of  a 
persymmetrical  R(m)  Is  equivalent  to  that  of  two  matrices  of  half  its  dimen¬ 
sion,  one  related  to  vectors  of  frequency  m  of  the  cosine  type,  and  the  other 
to  vectors  of  the  same  frequency  of  the  sine  type.  This  further  reduces  calcu¬ 
lation  by  a  factor  of  .  With  O(N)  R(m)  matrices  to  be  Inverted, 
the  total  comes  to  0(N*)  operations,  or  0(Na)  times  less  than  for  the  in¬ 
version  of  (Czz  +  D)  by  ordinary  techniques  (Choleskii  factorization,  Gauss- 
Jordan  elimination,  etc.).  0(Na)  is  also  the  order  of  the  number  of  data 
points  in  the  grid,  so  in  the  case  of  a  1°  x  1°  equal  angular  grid  with  64800 
elements  the  reduction  in  computing  time  is  0(64800). 

The  numerical  examples  In  section  3  all  involve  5°  x  5°  data  sets  with 
2592  elements,  so  (CIZ  +  D)  is  of  dimension  2592.  Settingup  and  inverting 
such  a  matrix  is  a  large  exercise,  even  with  a  modem  digital  computer  such 
as  the  AMDHAL  470  at  Ohio  State,  unless  the  matrix  has  a  strong  structure 
that  can  be  exploited  to  simplify  the  work.  As  such  is  indeed  the  case  here, 
the  subroutine  NORMAL  described  in  Appendix  B  has  been  able  to  do  the  "’hole 
setting  up  and  Inversion  in  only  20  seconds. 

The  inversion  of  (C-,.  +  D)  requires  (^dimension3)  operations  (0(N4)) 
instead  of  0(dimtr.sion3)  because  of  ihe  Toeplitz  clrcuiant  structure  of  the 
C*5  blocks.  This  "0(dimensiona)"  property  is  common  to  other  algorithms 
for  inverting  Toeplltz-type  matrices,  such  as  the  famous  Trench  algorithm 
(Trench,  1965),  and  the  Justice  algorithm  (Justice,  1977),  the  first  for  data 
sampled  on  the  real  line  and  the  second  for  data  sampled  on  the  plane,  So, 
in  spite  of  its  "rather  wicked"  nature,  the  sphere  allows  this  very  convenient 
property  of  regular  grids  to  apply  also  on  its  surface.  In  fact,  not  only  on  the 
sphere,  but  also  on  any  oody  of  revolution  (cone,  oblate  and  prolate  spheroid, 
hyjierboloids  and  parabo'oid.°  of  revolution,  etc.,)  rer.lar  sampling  ana 
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covariances  that  satisfy  the  "  j j— q|  condition"  will  result  in  covariance  matrices 
of  the  type  described  here,  and  this  is  also  true  of  other  matrices  based  on 
symmetrical  kernels,  such  as  the  normal  matrices  of  point  mass  models, 
wj^en  the  points  belong  to  a  regular  grid,  etc.  Finally,  the  optimal  estimator 
Ca>  =  J^am  ®  for  this  type  of  covariance  matrix  is,  as  shown  in  the  previous 
paragraph,  of  the  quadratures  type,  so  the  optimal  6^  can  be  obtained  using 
the  same  efficient  algorithms  described  in  paragraphs  (1.5)-(1.7).  Altogether, 
the  powerful  structure  of  the  covariance  matrix  for  regular  global  data  sets 
is  most  remarkable.  One  of  its  many  advantageous  features  is  that,  because 
the  creation  and  invertion  of  each  R(m)  can  be  done  quite  independently  from 
those  of  the  others,  the  algorithms  developed  for  this  type  of  matrices  are 
eminently  suited  for  implementation  in  parallel  processing  computers. 

The  separation  of  the  algorithm  according  to  orders  also  means  that, 
although  setting  up  and  inverting  all  the  R(m)  may  require  a  large  number  of 
operations,  only  a  fraction  of  those  actually  correspond  to  the  recovery  of  the 
Cn,,  of  any  given  m,  so  the  numerical  errors  due  to  rounding  or  truncation 
are  not  likely  to  accumulate  to  any  great  extent  in  the  results. 


2.12  Optimal  Formulas  for  non-Uniform,  Correlated  Noise 

Irregular  noise,  already  discussed  in  paragraph  (2.9),  may  be  due  not 
only  to  the  varying  quality  of  the  measurements,  but  also  to  the  way  the  data 
is  "grided",  L  e. ,  the  way  the  value  attributed  to  a  node  (or  block)  ij  is 
obtained  by  interpolation  from  actual  measurements  nearby,  as  usually  data  is 
not  sampled  regularly  on  a  global  basis.  As  the  number,  disposition,  and 
quality  of  the  measurements  used  will  vary  from  point  to  point  in  the  grid,  so 
will  the  accuracy  of  the  interpolated  values.  Furthermore,  even  if  the  measure¬ 
ments  themselves  are  not  correlated,  the  grided  values  may  be  correlated 
because  some  of  the  data  may  be  used  for  more  than  one  interpolated  value. 

This  brings  about  the  question  of  what  can  be  done  when  D  is  neither  diagonal, 
nor  are  the  Dlr  blocks  in  D  ,  corresponding  to  the  Cfr  blocks  in  Clz  ,  all 
Toeplitz  circulant.  The  answer  is  a  simple  extension  cf  the  results  already 
obtained  for  the  uncorrelated  case. 

When  the  noise  is  both  non-stationary  and  correlated,  replacing  the  co- 
variances  E  {njj  w, }  with 

?lrW  =  4S  (%  <E  fn”  )  +  E  {nn  “r  j*  }  )]  .  h  =  j  -  s, 

will  result  in  a  modified  "noise  matrix"  L  where  the  (corresponding  to 
the  D*  and  the  C*)  will  be  all  Toeplitz  circulant,  because  the  "covariance" 
CTSr,h|  satisfies  the  condition  that,  for  a  given  i  and  r  ,  it  is  a  function  of  |j-s| 
alone.  The  optimal  estimator  for  the  modified  measure 

«(L)=tr{C  -  2 Cat  FT  +  F(C„  +  L)  F^ 
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must  be  of  the  quadratures  type,  because  of  the  structure  of  L  .  To  show 
that  is  is  also  the  best  estimator  of  this  kind  in  terms  of  the  original  norm 


#{D)=  trf  C  -  2  Cei  FT  +  F (Ctl  +  D)  F^  , 
the  proof  will  proceed  much  as  in  the  case  of  paragraph  (2.9). 

The  propagated  error  measure  for  C31  is 

( N-1  3I+-1  N-l  a  M-l  /  \  1 

<&.  ■  E{|0  r*“{r4 


M_1  3N-1  N-l  3  N-l  (cos)  (cos) 

-r  Z  l  l  xr  xT  {.to}  mjAX  jsin) 


msAX  E 


1=0  j=Or=o*=0 
so,  for  CnB  and  combined. 


•1 

fn  tj  “r.J 


,a 


n-i  a  n-i  *~i  a  N-i 


Z  °rr" 

Of=0 


Z  Z  Z  Z*?*"  C0S E  (nU  nr*} 

1=0  J=0  r=0*=0 


thus 


N-i,  n  N-i  N-l  3N-1  a  N-l 

ijjlD)  =  tr  [FDFT]  =n£  Z  Z  L  xT  Z  L  co8m(3-s)AX 

N_X  B  N-l  N-l —  0  1  ~  N-l  i  -JN_1  ° 

“  Z  Y  Y  £x?*Xr“  f  cosmhAX  £  (E{nl3 

n=0«=01=or=0  h  =  °  J  =  0 

Replacing  both  E  (n13  n,  and  E  {n{)  0,^.1,}  in  the  last  expression  with 
aI^,ll,  is  the  same  as  replacing  D  with  L  ,  so 


W 


T  ff-i  a  N-X  M-l  „  N-l  3N-J 

$  (L)  =  tr(FLF]=r  V  Y  r  x"Xr  r  cosmhAX  y  2a 

n  ..0  .*0  1  =  0  r%0  l4o  it 0 

^  f  t  r  «.  -  r  »  »  .  ,  ^_tr  Ih 

=  y  y  y  y  xi  xr  Y  cosmhAX  4Na 

a  —  0  »  =  0  <*r0  r4r0  «o 

n-l  a  N-l  N-l  N-l  3  N-I  .  .  . 

-  f  f  r  FXiXr  r  cosmhAX  r  (E{dij  Vn*  }  +  E{nl3  n,^ |) 

a  to  u  =  0  or=o  1-0 

because  of  the  definition  of  Ir  lhl .  Comparing  the  expressions  for  ^(L) 
and  for  ?  (D)  it  is  clear  that  they  are  identical.  From  this  follows  that  the 
modified  error  measure  $(L)  coincides  with  $(D)  when  the  estimator  is  the 
optimal  estimator  of  the  quadratures  type  for  $(L),  and  that  this  must  be  tlie 

optimal  estimator  of  the  quadratures  type  for  $(D)  as  well.  The  other  con¬ 

clusions  arrived  at  in  paragraph  (2.9)  for  the  uncorrelated  case  apply  equally 
well  here. 


2.13  Least  Squares  Adjustment,  and  Least  Squares  Collocation 
(a)  Band-Limited  Signal 

If  there  is  a  degree  Nlt,  above  which  the  degree  variances  are  all 
negligible  or  zero,  then  the  signal  can  be  said  to  be  band  limited,  and  the  data 
will  satisfy  equations  of  the  type 
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(2.65) 


Nb*x  n  I  _ n  _o; 

m13  =  r  f  Y  CB“  yaa  (Mj>  +  nl3 

n  =  0  B=0^0 

(tlie  treatment  here  is  for  point  values;  the  extension  to  area  means  is  trivial) 
With  one  equation  such  as  (2.65)  per  point  in  the  grid,  the  result  is  a  system  of 
equations 

m  +  v  =  Ac  (v  =  -n)  (2.66) 

where  A  is  a  Np  xN,  matrix  (Np  is  the  number  of  points  in  the  grid, 

N0  the  number  of  coefficients).  The  columns  of  A  consist  of  elements  of  the 

type 


a"  =  Y^fr.Xi) 


(2.  67) 


According  to  the  discussion  in  paragraph  (1.3),  if  the  grid  is  equal 
angular,  A  has  full  rank  when  N,.x  *  N  .  N0  =  N  2  and 
the  upper  limit  in  the  summations  is  N  -  1  in  what  follows. 


Least  squares  adjustment  is  a  method  for  solving  for  the  while  mini¬ 
mizing  the  propagated  noise  defined  in  paragraph  (2.4).  The  least  squares 
solution  is 


c  =  (ATD-1A)_1  ATD_1  in 
=  G_1AT  Lf1  m 


(2.  68) 


where 

G  =  AtD-1A  (2.69) 

is  the  Nc  x  Nc  normal  matrix,  while  D  =  E{n  n}  is  the  same  noise  matrix 
considered  before.  Clearly,  the  least  squares  estimator  matrix  is 

F 1,  =  (AtD_1A)  ATD_1 

When  the  noise  has  zero  moan  (E  {«}  =  o  ),  the  estimator  of  (2. 68)is  the  best 
linear  unbiased  estimator,  because  it  minimizes 

tr  (E  {e  nnT  pT}  ]  =  tr  [PDF') 

and  E  {E  (z  +  n)}  =  c  .  If,  in  addition  to  all  this,  the  probability  distribution 
of  the  noise  is  Gaussian,  then  (2.  68)  corresponds  to  the  maximum  likelihood 
estimator  as  well.  In  many  scientific  applications  the  noise  has  approxinntely 
zero  mean  and  near-Gauss ian  distribution,  while  D  is  known  reasonably  well; 
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for  this  reason,  methods  based  on  expression  (2.68)  are  used  quite  often.  The 
linearity  of  the  resulting  estimators  is  helpful,  because  this  avoids  the  use  of 
methods  based  on  non-linear  formulas  that  are  usually  difficult  both  from  a 
theoretical  and  from  a  practical  point  of  view.  The  variances  of  the  estimates 
axe  given  by  the  corresponding  diagonal  elements  of  the  a  posteriori  variance- 
covariance  matrix 

E  -  (ATD"1A  f1  =  G"1 

It 

Therefore,  to  obtain  both  the  estimates  and  their  variances  it  is  necessary  to 
know  Cf1  .  Sometimes,  because  of  the  nature  of  A  and  D  ,  G  can  be  seriously 
ill-conditioned,  the  Inversion  suffering  from  strong  numerical  instabilities.  To 
reduce  this  problem, a  simple  device  known  as  regularization  is  often  used  (see, 
for  instance,  Tikhonov  and  Arsenin,  1977).  Generally  speaking,  regularization 
is  the  introduction  of  a  slight  change  in  a  problem,  so  the  solution  virtually 
rrmains  the  same,  but  the  modified  problem  has  better  numerical  properties. 

In  least  squares  methods  regularization  usually  implies  adding  a  small  positive 
definite  matrix  K(diagonal,  as  a  rule)  to  G  before  attempting  to  invert  it. 

The  regularized  optimal  estimator  would  be 

c  =  (ArD'xA  +  K)  ATD“lm  (2.70) 

The  inverse  of  the  covariance  matrix  of  the  harmonic  coefficients  C  is  a  pos¬ 
itive,  diagonal  matrix  which  could  be  used  to  regularize  the  normal  matrix: 

4  =  (AtD“1A  +  CT1)  ATD'1m  (2.71) 

It  is  easy  to  see  that  this  expression  minimizes  the  quadratic  form 

Q  =  cTC-1c  +  vTD-1v  (2.72) 

subject  to  the  constraint 

m  =  Ac  +  v  (2. 73) 

Moreover,  (2. 72)  is  the  equivalent  of  the  least  squares  collocation  error  measure 
when  then  signal  is  band-limited  (Moritz,  (1980)).  This  idea  has  been  used,  among 
others,  by  Schwarz  (1975)  for  the  determination  of  low  degree  zonal  coefficients 
of  the  geopotential,  and  by  Lerch  et  al.(1979),  who  employed  it  to  estabiiize  the 
adjustment  of  the  GEM-9  gravity  field  model  with  remarkable  success.  The 
equivalence  of  (2. 71)  to  the  collocation  estimator  is  true  only  for  band-limited 
signals;  to  the  "real  world"  the  gravity  field  has  infinite  bandwith,  so  (2. 71) 
is  no  more  than  an  approximation.  The  band-limited  assumption  is  a  reasonable 
one,  however,  as  the  eventually  become  negligible  for  large  n. .  This  is 
particularly  true  at  satellite  altitudes;  in  any  case,  geodesy  is  a  science  of 
wise  approximations.  Moritz  (1980)  has  provided  a  very  clear  and  concise 
explanation  of  the  use  of  collocation  in  general,  and  expression  (2. 71)  in  parti¬ 
cular,  in  spherical  harmonic  analysis. 
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An  alternative  derivation  of  (2. 71)  follows  from  the  matrix  equation 

CAt(ACAT  +  Df1  -  (At  D”1  A  +  C"1)"1  AtD_1  (2.74) 

(see,  for  instance,  Uotila  (1978),  equation  (29)),  which  is  valid  for  symmet¬ 
rical  matrices,  provided  the  inverses  or  pseudo  inverses  of  D  ,  C  ,  and 
(A  C  At  +  D)  do  exist.  According  to  the  definitions  in  paragraph  (2.2): 

C0*  *  m{cz’}  =  m{££.TaT}  =  m{££T}aT  =  caT  (2*75) 

C**  =  m{z  zT}  =  m{ac  cTAT}  =  Am{c  cT}  AT  =  ACAT  (2.76) 

Replacing  C0*  and  Ctt  in  the  expression  of  the  collocation  estimator  matrix 
(2, 39)  with  their  equivalents  given  by  (2. 75)  and  (2. 76): 

F  =  C„  (C„  +  Df1  =  CAT(ACAT  +  Df1  (2.77) 

=  (ATD_1A  +  C-1  )_1  AT  D_1 

according  to  equation  (2. 74).  This  shows  that  the  "regularized"  estimator  matrix 
(AtD-1  A  +  C-1)"1  AtD-1  Is  indeed  the  same  as  the  collocation  estimator 
matrix,  so  (2, 71)  represents  an  alternative  form  of  collocation  when  the  data 
is  band-limited. 

(b)  Infinite  Bandwlth 


In  this  case  the  "observation  equations"  are 


00 

mU  ■  I 

o  =  0 

a  I  fu 

l  E 

■=°  a=Q 

Y®  (8i  ,Xj)  +  n14 

Calling 

•  a  1  <y 

»•,  *  E  E  E 

a=H*~L  •a=0  q;  =  0 

7f.(0i,X4) 

(2. 78a) 

and 

W  -  [wx  . 

.  .  w^  .  .  .  wNp]T  (k  »  2N  1+  j) 

(2. 78b) 

then 

N 

mU  •  I 

»  J-  -a 

[  [c. 

(0i  f^i)  +  3  +  nu 

(2.79) 

a  =0  iso  0 


and,  regarding  this  expression  as  a  modified  observation  equation,  and 
replacing  D  with  D  +  M|wwTj  in  (2. 71),  the  linear  estimator  that  mini¬ 
mizes  the  quadratic  form 


(2. 80) 


£  =  (A T(D  +  m{w  wT}  fl  A  +  C-1  l"1  AT(D  +  M|w  wT}  f1  m 

It  is  easy  to  show,  either  following  the  lines  of  Moritz  (1972),  or  going 
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(2.81) 


back  once  more  to  the  matrix  identify.  74),  that  expression  (2.81)  is  identical 
with  the  estimator  of  least  squares  collocation.  Rigorously  speaking,  expres¬ 
sion  (2.81)  should  be  used  whenever  oj  /  0  for  n  >  N  ,  even  if  <fn  =  0  for 
n  >  Nm  x  t  for  some  finite  N#x  >  N  . 


2.14  Ridge  Regression  and  Least  Squares  Collocation 
Consider  once  more  the  estimator 
c  -  Fj^s  m  =  (ATD_1  A)-1  A1!)'1  m 
If  there  is  no  noise,  so  m  =  z_  =  A  £  ,  and  if  n  <  N  ,  then 
c  =  (ATD-1A)-1  AtD_1A  c  =  c 

According  to  the  definition  given  in  paragraph  (2.4),  the  sampling  error  of 
this  estimator  is  zero,  so  the  measure  of  this  error  must  be  also  zero.  If. 
the  noise  has  zero  mean,  it  follows  that 

E{6}  =■  e{Fj1.  m}  -  F(.  e{a  c  +  »}  -  F,.  A  c  +  E{n}  =  c 

or,  as  it  is  usually  said,  the  estimator  is  unbiased.  Moreover,  by  a  simple 
extension  of  the  Gauss-Markov  theorem  to  the  case  of  a  general  symmetrical 
positive  matrix  D  (see,  for  instance,  Bibby  and  Toutenberg,  1977),  F^s  m, 
of  all  linear  imbiasecLestlmators,  has  the  least  propagated  error  measure 
trjFjj,  DFT£«J  =  tr  -  (ATD-1A)j  ,  as  mentioned  previously. 

The  estimator  of  expression  (2.77)  does  not,  in  general,  give  perfect 
estimates  of  c  in  the  absence  of  noise:  it  is  a  biased  estimator,  and  the 
measure  of  the  bias  is  tr  |c  -  2C„IFT  +  FC,  rFT  j  (thisternf’ean  no  longer 
be  regarded  as  the  measure  of  the  sampling  error,  as  it  is  the  presence  of 
CT1  inside  the  parenthesis  in  (2.71)  and  not  the  sampling  that  brings  about 
this  error).  According  to  (2.39),  F  is  the  estimator  that  minimizes  the 
total  error  measure,  so 

af  =  tr  {c-2CSJF+  F(C„  +  D)  FT} 

£  tr  {c  -  2C,2  FTx,  +  F*.  (C„  +  D)  Fj.} 

*  “  K-dfV] 

If  the  covariance  matrix  is  positive  definite  (which  only  requires  that  all 
/  0  for  0  <  n  r  N)  then  the  last  expression  applies  with' strict  inequality 

tr  {c  -  2C.,  FT  +  F(CJ2  +  D)  FT}  <tr{F/,  Dl’J,} 

-  tr {  (ATD_1  A)-1} 
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In  the  band -limited  case. 


Some  may  find  this  result  rather  surprising:  the  best  estimator  with  zero  bias 
is  in  fact  worse  than  the  biased  estimator  of  (2.  71)  !  The  difficulty  is  only 
apparent:  F^.  is  the  best  estimator  with  no  bias;  once  the  "no  bias”  con¬ 
dition  is  removed,  the  expression  above  merely  indicates  that  there  is  an 
estimator  in  the  larger  class  of  the  estimators  that  have  a  bias  (including 
those  with  zero  bias)  such  that  the  sum  of  the  bias  and  the  propagated  noise 
is  smaller.  From  this,  it  is  clear  that 

tr  {FDFT|  <  tr  -^(AT  D~ 1  A  )|  (as  tr  Ic  -  2C«  FT  +  FC„  FT  j  &  0  ) 

which  is  indeed  possible  when  the  condition  C  -  2  C0ZFT  +  F  C„  FT  =0 
is  removed.  In  fact,  there  is  nothing  very  new  about  all  this:  the  use  of 
biased  estimators  to  obtain  estimates  with  small  variances  is  a  reasonably 
well-established  practice  in  applied  statistics.  In  particular,  the  technique 
known  as  ridge  regression  consists  in  using  the  biased  estimator 

c  =  (xx  +  K)"1  xT  m 

with  a  suitable  choice  of  K  (Bibby,  1972).  Clearly,  this  expression  is  the 
same  as  (2.  71)  when  x  =  A,  D=I,  and  K  =  C-1.  With  some  obvious  modifi¬ 
cations  suggested  by  (2.81),  this  argument  can  be  extended  to  the  estima¬ 
tion  of  c  when  n  s  N  ,  so  it  can  be  said  that  within  the  scope  of  spherical 
harmonic  analysis  least  squares  collocation  is  a  form  of  ridge  regression. 

This  brings  up  the  question  of  just  how  realistic  the  error  measure 
is;  after  all  the  best  of  all  possible  estimators  in  terms  of  a  given  norm 
could  be  a  very  had  one  for  some  specific  problem  where  that  norm  is  not 
suitable.  To  answer  this  question,  one  must  start  by  defining  the  meaning  of 
"realistic".  If  one  is  interested  in  minimizing  the  actual  error  variance  of 
the  coefficients  per  degree,  i.e. ,  the  expression 

6  »  =  E  t  A  -  Cf.  f  (2n  +  If1 
a=  o  i=o 

which  may  be  of  interest  because  this  corresponds  to,  say,  the  global  nmefin 
square  of  the  error  of  representing  thp  continuous  function  (9,A)=  £ 

^nn  Tn,(0,A)  with  ^w^t«(®>^),  according  ^  v 

to  Parseval's  identity  (exp.  (1.12)),  then  one  could  say  that  a  realistic 
measure  is  one  that  gives  close  estimates  of  the  actual  crrorvariances. 

The  "actual  error"  measure  6®  corresponds  to  one  of  infinitely  many 
’fevents"  over  which  the  collocation  measure  is  an  average.  The  proof  of  a 
pudding  being  in  the  eating,  the  reader  can  judge  just  how  realistic  the  collo¬ 
cation  measure  is  by  looking  at  the  numerical  results  in  section  3,  where  the 


value  of  the  measure  turns  out  to  differ  only  by  a  small  percentage  from  the 
actual  variance  in  each  one  of  a  number  of  simulated  ’’events",  j.e. ,  the 
recovery  of  the  Cj^  fro m"simula ted  data','  where  the  q®  are  known  random  num 
bers  scaled  to  have  the  desired  power  spectrum. 


2.15  Structure  of  the  Normal  Matrix 


The  elements  of  the  normal  matrix  G  (in  the  case  of  point  data)  are 
of  the  form 

N-l  3  N-l 

Snn, (pi  Z  ^p<i 

N-l  a N-i  ^  .  (2. 82] 

=  Z  P»p(cos9i)  ppq(cos0i)  Z  \sin  }  mJAX  ■isi>nlqjAXCT"iS 

where  for  all  0  <j  ^  2N  (i.e.,  "regular"  noise  as  defined  in 

paragraph  2.8).  If  n  ,  m  <  N  ,  then  the  following  equations  apply: 

“"l  {sto}mJiX{s^S}<5JaX=  0  “  <2'6S 

J  —  o  rn  f  q 

Moreover,  if  the  grid  is  symmetrical  with  respect  to  the  Equator  (as  equal 
angular  grids  are),  and  if  of  =  erf-j-,  (for  instance,  if  is  indepen¬ 
dent  also  of  i  )  the  relationships 

of  Pn,  (cos©,)  =  PnB(cosel) 

°i3  (cos0i)=pPa  (cos  6\)  (-lf'cr^ti 

9i  =  77- et 

must  apply,  according  to  par.  (1.2),  and  from  these  follows 

N-l  _ 

^  erf3  (cosSt)  PpB^osSj)  =0  if  n  -  p  is  odd.  (2.84 

In  brief:  if  n  ,  m  <  N  ,  and  the  grid  is  regularly  spaced  in  longitude,  and 
symmetrical  with  respect  to  the  equator,  then 

{«  f  0,  m  /  q 


«  0  if 


(2.85 


n  -  p  is  odd 


If  neither  of  the  conditions  listed  above  apply,  then  g,,,’^,  may  or  may  not 
be  z.ero.  If  the  coefficients  C^,  are  ordered  in  c  so  that  all  those  of  the 
same  m  are  grouped  together,  and  for  a  given  m  all  "cj,  are  separated 
from  all  C«B  *  and  further  more  all  C®,  with  n  -  m  even  are  separated  from 
all  those  with  n  -  m  odd,  then  the  normal  matrix  becomes  arranged  in  such  a 
way  that  all  potentially  non -zero  elements  (i.e.,  not  satisfying  (2.  85))  are  also 

A 

grouped  together  forming  a  series  of  diagonal  blocks  G„  ,  where  6  signals 


the  parity  of  n-p.  Each  one  of  these  diagonal  blocks  is  made 
exclusively  of  one  of  the  following  types  of  elements 


n~P  even  (6=0) 

.  n-p  odd  (6=  l) 


If  the  .grid  is  not  symmetrical  with  respect  to  the  equator,  groups  of  type 

Jand  \§=ij  ,  or  IqJ  and  become  included  into  larger  non¬ 

zero  blocks,  as  there  are  more  non-zero  elements  in  that  case.  However, 
here  the  discussion  will  cover  only  the  symmetrical  case. 


The  largest  blocks  are  Go'^ ,  and  their  dimension  is  N2; 
the  smallest  blocks  have  dimension  lxl,  for  example  . 


The  inverse  of  any  block-diagonal  matrix  suoh  as  G  is  another  block- 
diagonal  matrix  made  up  of  the  inverse  of  the  blocks  of  G  .  There  are 
4  N  -  2  diagonal  blocks  G®’  in  G  ,  and  as  many  in  G-*1. 


The  eigenvalues  of  G  are  those  of  the  diagonal  blocks,  and  the 
eigenvectors  of  G  ,  those  of  the  same  blocks  "expanded"  with  zeroes  at  both 
ends,  so  as  to  reach  the  dimension  Np  of  G  . 


The  estimates'  vector 


c  =  G-1  AT  D"1  m 


A  6 

can  be  partitioned  in  the  same  way  as  £  ,  each  partition  c®'  including  all 
coefficients'  estimates  of  the  same  m  ,  a,  and  parity  6  of  n  -  m  ; 


aO,  6 
£a 


(G^V1 


D-1  m 


(2. 86) 


where  A„  isa(N  -  m)  x  Np  matrix  with  rows  that  are  Np  -  vectors 
of  the  type 


,  -  «  rcos-j 

£»,n  =  1  •  •  •  Pro,  (cos 0j  )  |gin  j  mjAA.  .  .  )  (2.87) 

(n  -  m  even  if  6=0,  odd  if  6  =  1  )  So  the  rows  of  (G^’  )_1  A®'^  are 
linear  combinations  of  vectors  of  the  same  frequency  m  and,  therefore,  also 
vectors  of  the  same  frequency: 


=  [  •  .  •  Xi'jfe&f}  mj  AA .  .  .  ]  (2.88) 

f V 

Consequently,  the  estimate  of  a  giver  CnB  is 

6“  =  h®  D_1  m  =  T  f  Xi"  {shi  } m j  AX  of  mjj  (2.89) 

i  =  o  3  =  0 

and  this  is  a  quadratures  type  estimator. 
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Setting-up  and  inverting  the  G,  blocks  is  tantamount  to  setting  up 
and  invertingthe  whole  matrix  G  .  Since  all  operations  related  to  one  of  the 
blocks  are  independent  from  those  for  the  others,  the  inversion  of  G  i|^  g 
ideally  suited  for  parallel-processing  computing.  On  average,  each  Ga ' 
requires  0(N3)  operations  to  invert,  or  0(N4)  altogether.  Inverting  G 
by  ordinary  techniques  would  involve  0(/(N  )3  )3)  =»  0(N6)  ,  so  there  is  an 
increase  in  efficiency  of  0(N2)  . 


Finally,  it  is  quite  simple  to  show  that  these  properties  carry  over  both 
to  the  case  of  area  means,  and  to  problems  where  the  surface  being  studied  is 
not  a  sphere,  but  a  surface  of  revolution  symmetrical  about  a  plane  perpendi¬ 
cular  to  the  axis  of  rotation,  provided  that  the  longitude  increments  be  constant 
and  the  grid  symmetrical  with  respect  to  the  "equator".  In  this  lattercase, 
the  expansion  of  the  signal  in  solid  spherical  harmonics  is 


i] 


N-l 


a  v— i 
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a=o 
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-snr 
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and  the  factors  r^i  are  symmetrica]  about  the  equator,  from  which  all  the 
properties  already  mentioned  for  G  follow. 


Clearly,  the  structure  of  G  possesses  many  properties  similar  or 
identical  to  those  of  (Czl  +  D)  when  the  data  is  regularly  sampled  on  a  sur¬ 
face  of  revolution.  These  similitudes  underline  the  intimate  relationship 
between  least  squares  adjustment  and  least  squares  collocation  shown  in  the 
preceeding  paragraph.  In  feet,  as  least  squares,  regularized  least  squares, 
and  collocation  differ  only  in  the  diagonal  matrix  (  K  ,  orC"1)  being  added 
to  G  In  expressions  (2.68),  (2.70),  and  (2.  77),  all  the  properties  mentioned 
here  for  G  apply  to  the  normal  matrices  in  each  of  the  three  methods  equally 
well.  The  one  important  consideration,  in  the  case  of  collocation,  is  that  t£e 
data  be  band-limited.  Otherwise,  expression  (2. 81)  indicates  that  (C  +mWm 
and  not  ~cT*  muitbe  added  to  G  .  Matrix  C  +  w  wT  \  has  the  same 
Toeplltz-type  structure  of  (CIX  +  D)  discussed  in  paragrkph  (2:!  10).  Therefore, 
creating  and  inverting  the  normal  matrix  requires:  (a)  creating  and  inverting 
C  +  M-fw  wr|,  and  (b)  creating  and  inverting  AT  D-1  A  +  (M  jw  v/'l-  +  c )_: , 
which  can  be  shown  to  have  the  same  block  diagonal  structure  discussed  here. 
This  is  twice  the  work  needed  to  set  up  and  Invert  (Ct  t  +  D)  using  the  approach 
of  paragraph  (2.11),  so,  in  the  case  of  infinite  bandwith,  that  approach  is  more 
economical  in  computing  and,  therefore,  more  practical. 


There  may  be  one  important  point  in  favor  of  using  formula  (2.  77) 
or  (2.  81)  rather  than  formula  (2. 39)  for  obtaining  the  optimal  estimator 
matrix  F  at  least  in  the  band-limited  case:  as  the  density  of  the  grid  in¬ 
creases,  matrix  (C*  +  D)  becomes  increasingly  more  ill-conditioned,  because 
the  closer  distance  between  data  points  results  in  covariances  that  have  much 
the  same  values  in  consecutive  rows  or  columns.  On  the  other  hand,  the  non-zero 
diagonal  blocks  in  G  are  likely  to  become  more  and  more  diagonal -dominant 
as  ,  AX  0  .  This  will  depend  on  D  :  for  instance,  if  the  variances  of  the 
noise  were  of  the  form 
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then 


ofj  =  sin  8t 


Li,  . 

A 9  AX  *a_6_ 

N-l 

=  L„  £ 

A0,  AX 

4tt 

A9,  AX  1=0 

-0  -0 

=  -1_ 

f  Y?.(e,X) 

-0  -0 

(e,X)  do  = 

4tt 

Jo 

T&Oi.Xj)  Y^(6ltX})  sin6,  A0AX 
3  =  o  4tt 

0  if  n  t  p 
1  if  n  =  p 


because  of  the  orthogonality  relationships.  In  general,  the  variances  of  the 
noise  are  not  going  to  follow  a  sinusoidal  law,  but  one  may  reasonably  expect 
(at  least  with  more  or  less  homogeneous  noise)  that  the  stability  of  the  normal 
equations  will  not  deteriorate  with  A  6,  A  X  -  0  . 


2.16  Global  Adjustment  and  Collocation  with  Scattered  Data 

The  efficient  set  up  and  inversion  of  the  covariance  matrix  (C„  +  D), 
or  of  the  normal  matrix  G  ,  depend  on  the  regular  nature  of  the  grid.  If 
not  all  nodes  or  blocks  in  the  grid  have  data  associated  with  them,  the  data  is 
said  to  be  scattered.  The  blanks  or  "holes"  in  the  grid  destroy  the  orderly 
structure  of  the  matrices,  making  the  application  of  the  techniques  previously 
discussed  impossible.  Yet  so  strong  is  this  structure  that,  even  in  fragments, 
still  it  can  be  dealt  with  more  efficiently  than  in  the  case  of  ordinary  matrices 
of  the  same  size. 

(a)  Full  Region  Bound  by  Lines  of  Latitude  and  Longitude 

In  the  case  when  there  is  data  at  every  point  or  block  inside  a  "square" 
region  limited  by  parallels  and  meridians,  the  partitioning  of  the  data  vector 
along  the  arcs  or  parallels  inside  the  zone  reveals  a  strong  structure  in  the 
(C, ,  +  D)  matrix,  if  all  the  other  assumptions  made  in  paragraph  (2. 10)  still 
apply. 

If  Nr  is  the  number  of  rows  and  N,  the  number  of  meridians  that 
cross  the  region,  then  the  covariance  matrix  will  consist  of  N?  blocks  Clp  of 
dimension  N«  ,  both  per  symmetrical  and  Toeplitz,  though  not  circulant 
(i.e.,  the  relationship  =  c£1<l+1  is  fulfilled,  but  not  c0*  =  cN^q_i); 
moreover  the  first  row  in  each  block  does  not  have  the  property  that 
Cq,  =c0£0_,  .  clearly,  though  weaker  than  in  the  case  of  a  global  grid, 
there  is  a  definite  structure  here  that  can  be  exploited  to  make  both  setting 
up  and  Inverting  the  matrix  more  efficient. 

Because  each  block  C  *  is  Toeplitz,  only  the  N0  elements  in  its 
first  row  have  to  be  computed,  or  about  £n«  Nr  for  the  matrix  as  a 
whole,  instead  of  ^N,aNr3  ;  this  amounts  to  a  reduction  in  operations  by 
a  factor  of  N,  . 
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The  solution  of  the  equation  fn,  =  (C«  +  D)"1  c^attiZ  ,  for  the  optimal 
estimator  vector  f  for  q,a  ,  can  be  obtained  by  a  technique  such  as 
conjugate  gradients,  or  similar,  in  which  a  finite  number  K  of  matrix- 
vector  multiplications  (C„  +  D)vt  (where  v 0  ,  Vj  ,  ...  vt  ....  v..  are 
K  intermediate  NP  -  vectors  created  during  the  solution)  constitute”  the  bulk 
of  the  computing  effort.  There  is  no  need  to  go  into  the  details  of  any  specific- 
technique,  as  the  reader  will  find  excellent  descriptions  in  the  literature 
(Householder,  1964,  Luenberger,  I960).  A  discussion  of  the  matrix-vector 
operation  is  sufficient  here. 


Let  m1  be  the  Nc-  vector  partition  of  the  N*  N.data  vector  m  , 
containing  the  measurements  along  the  ith  parallel  in  the  region,  and  let  v/ 
be  the  corresponding  partition  in  any  of  the  vt  vectors.  The  product  (CZ2  +  D)vt 
=  pt  is,  under  such  partition, 

r  O  N  r~b. T 

£t  _  [£t  •  •  •  £t  1 


with 


Nr_l 


(2..  90) 


so  the  whole  matrix-vector  multiplication  can  be  broken  up  into  NP  products 
CIpv^  =  hlp  .  Because  Ci?  is  a  Toeplitz  matrix,  the  N0  components  of 
htlp  can  be  obtained  by  "weighted  running  averages"  or  discrete  convolution 
of  the  elements  of  vf  with  those  in  the  first  row  of  Such  convolution 

can  be  calculated  efficiently  using  the  Fast  Fourier  Transform  algorithm  (see, 
for  instance,  Brigham,  1974,  Ch.  13).  Therefore,  all  Nf  products  involve 
0(NC  N^)  operations,  and  since  there  are  K  matrix-vector  multiplications 
in  the  whole  procedure,  the  total  number  of  operations  needed  to  obtain  i  ® 
amounts  to  0(KNc  N,2).  For  conjugate  gradients,  K  does  net  exceed  (in 
theory)  Np=NrN0  ,  so  there  should  be  0(N?  N3)  operations  altogether. 

If  (Cz 2  +  D)  were  handled  by  conventional  techniques,  disregarding  its  well 
defined  structure,  the  number  would  be  0(N„3  N, )  ,  so  tlie  increase  in  ef¬ 
ficiency  is  0(N„)  ,  the  same  as  for  the  setting  up. 


(b)  Arbitrarily  Scattered  Data 


It  is  common  in  geodesy  and  in  geophysics  to  have  a  set  of  measurements 
scattered  throughout  the  globe,  without  the  data  being  on  the  nodes  of  a  reg- 
ular  grid  or  without  all  o"j  j  being  equal  along  parallels  (nonhomogenous  noise). 
If  the  set  is  dense  enough,  however,  it  is  possible  to  interpolate  the  data  quite 
reliably  on  the  closest  nodes  of  a  conveniently  chosen  grid.  Assuming  that  this 
is  done,  and  that  the  accuracies  of  the  interpolated  values  are  known  well 
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enough,  then  the  problem  can  be  dealt  with  by  conventional  least  squares  or 
by  collocation.  In  general,  there  will  be  blanks  or  "holes"  irregularly  dis¬ 
tributed  over  the  sphere,  the  actuid  data  points  falling  among  them  in  no 
precise  pattern.  This  problem  will  be  considered  hero  as  a  least  squares 
adjustment  problem.  If  the  data  has  little  or  no  power  above  the  Nyqulst 
frequency  of  the  grid  on  which  it  has  been  interpolated,  then  the  extension  of 
the  ideas  that  follow  to  collocation  is  quite  simple,  according  to  paragraph 
(2.13). 


Consider  the  element  g“^4  of  the  matrix  G  -  AT  D  1 A 


= 

bn  m J>Q 


£  pns  (cose,)  Pp4  (cos  0, )  £ 
1=0  1=0 


|  sin  |  {sin  }  ff*l 

(2.91) 


wtere  Wjj  =  0  if  the  point  ij  is  blank,  otherwise  VV13  =  1  ;  ofj  is  the  var¬ 
iance  of  the  noise  n,3  .  In  general  (Ttj  is  a  function  both  of  i  and  of  j  . 
Clearly,  matrix  D  is  taken  to  be  diagonal  (i.e. ,  uncorrelated  noise) . 


From  the  relationships 


cos  m  j  AX  cospjAX  =  i  [cos  (m  +  p)  j  A  X  +  cos  (m  -  p)  j  A  X] 
cosmjAX  sinpjAX  =  i[sin(m  +  p)  j  A  X -sin(m  -  p)  j  AX  ] 
sinmjAX  sinpjAX  =  -|  [cos(m  +  p)  i  AX- cos(m  -  p)  j  A  X] 


and  calling 


SN-l 


Cr01  ■ 


cos  r  jAXa„  W1: 


BN-l 


Cr1=  2  ^  sin  r  j  A  Xa,3  W,j 
where  -N  <r<2N  ,  follows 


2N-1 

i  ^-o 


»?,  W„«  n*U  ffilpiW  "  [cjV* 

(3) 


(O) 

Moreover,  because 


cos  (-r)  j  A  X  =  cos  r  j  AX 

sin(-r)jAX=  -sinrjAX 

cos  r  j  A  X  =  cos  (2N  -  r)  j  A  X  (2nA  X=  2u) 

sinrjAX  =  -sin(2N  -  r)  j  AX 


(2.92) 


(2.93) 


follows 


o*?  - 

c,ia 

(-1  f 

varr  “ 

Cr® 

(-1  f 
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where  the  last  one  is  of  special  interest  when  r  s  n  .  So  only  the  C,‘ a 
with  0  <  r  <  N  are  needed.  Finally,  calling 

=  Pa»(cos  0,)  (COS0J)  [CL?"P|  +  (-D  aa*P  cftf*  )  (2.94) 

(where  jot-  /3|  is  the  absolute  value  of  a  -  £) 


it  is 


&  a», pq 


=  E  sift 


1  =  0 


(2.95) 


Assume  that  out  of  N  rows  the  grid  has  only  I  with  any  data  in  them,  so 
at  least  one  W{1  ^  0  in  each.  Once  the  corresponding  Crl£*  have  been  ob¬ 
tained  by  computing  the  discrete  Fourier  transform  of  o^3  Wtj  along  each 
row  with  data  ^(N3)  operations  for  the  whole  grid),  what  remains  is  to  get 
the  I  non-zero  terms  gat^f  that  form  each  element  gffi{?a  of  G  .  As  there 
are  about  ^N4  such  elements  that  are  different,  and  I  is  O(N)  (except  for 
very  sparse  data  sets),  the  total  number  of  operations  needed  to  create  G 
is  OfN3)  +  0(N&),  or  virtually  0(N6)  .  If  the  g^,  were  computed  according 
to  (2.91)  as  it  is  written,  instead  of  according  to  its  reduced  version.  (2.95), 
the  number  of  operations  would  be  0(2N3  (N)4  )  ,  or  0(Na  ),  so  the  gain  in 
efficiency  allowed  by  this  approach  is  O(N)  ,  which  is  the  same  as  in  the  case 
studied  in  the  first  part  of  this  paragraph,  where  the  data  completely  filled 
a  "square"  sector  of  the  sphere.  This  count  does  not  include  the  time  needed 
to  obtain  the  (cos  9t ),  as  these  can  be  pre -computed  once  and  kept  on 
disk  or  tape  for  repeated  use. 


The  number  of  operations  can  be  reduced  further,  almost  by  half,  by 
takira  advantage  of  the  fact  that  Fa  ,(cos  £  i )  is  a  common  factor  in  all 
gn£pq  the  same  n  and  the  same  m  .  Furthermore,  if  there  are  pairs 

of  rows  i  and  NH. -1  (l.e.,  symmetrical  with  respect  to  the  Equator)  where 
both  rows  contain  some  data,  then 

Safcpq  +  &na,P9  ~  'Pat  (COS  9j)  Ppg  (COS  0  j) 

iiciir'*1* (-i)”^0"01)  ♦  * (-1)“  ,J 

(2.96) 


which  leads  to  further  savings  in  computing  at  fhe  cost  of  additional  program¬ 
ming  complexity.  These  economies  are  important,  but  they  will  not  bring  the 
number  of  operations  much  below  0(NS)  unless  the  grid  is  so  sparse  that 
I  is  much  smaller  than  N  . 

Notice  that  the  normal  matrix  G  =  AT  D_1  A  is  created  here  without 
actually  forming  the  observation  equations  matrix  A.  This  means  consid¬ 
erable  savings  in  computing  and  in  storage  requirements.  As  for  the  right 
hand  side  of  the  normals  AT  D"1  m=  b  ,  the  elements  of  b  are  given  by  the 
formula 
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(2.97) 


r 


N-l  3N-1  (-  i 

b*  =  I  I  3>.  (cos 9t )  mjAX  aTj  Wn  m„ 

where  k=n3+an  +  m  +  l  and  Wt  j  =  0  if  there  is  no  data  at  the  point  lj  , 
as  before.  Such  expression  is  of  the  quadratures  type,  and  can  be  computed  ef¬ 
ficiently  by  the  corresponding  algorithm  of  section  1,  also  without  first  creating 
A  .  Finally,  the  residuals  vector  v  =  m  -  A  A  «  usually  of  interest,  can  be 
obtained  as  the  difference  between  the  data  m  and  the  values  of 

I  t  Pn.  (cos9t)  {g^}  mjAX 

Ct  =  Oa  =  o  »“o 

computed  by  means  of  the  appropriate  synthesis  procedure  given  in  section  1. 
Thus  v  can  be  found  without  knowing  A  explicitly. 


The  normal  equations  can  be  solved  by  means  of  conjugate  gradients 
or  a  similar  method  involving  M  matrix-vector  products  xt  -  Ght  ,  where 
ht  is  part  of  a  sequence  of  Intermediate  vectors  h0  ,  hi  .  ht  ,  .  .  .  h,  . 

Introducing  the  notation 

a 

fea«t  =  ikt 


where  k=na  +  an  +  m  +  l,  and  calling  G  1  to  the  matrix  of  all  g1,®^  , 

N-i  .  N— 1 

SO 

L.  — ’  L.  ~  —  ‘  4.  —  ‘ 

1=0  1=0  1=0 

of  the  form 


(2.98) 


then  each 

*n«t 


N-l  ,  ,  N-l  N— i  , 

G  =  [  Gl  and  xl  *  [  G‘ht  =  [  x\ 

1=0  1=0  1=0 
ch  element  of  the  product  vector  is 

s;  -  f 1  e  t  -  *.<•»•»,>  j-  [C‘.Lrv 

P=0  ,  =  0  (J  =  o  Pq0 


Pq0 

.  Pp  a  (cos  Si)  h^,  t 


(2.99) 


Every  h^t  above  multiplies  always  the  same  Pp,  (cos8t ) ;  calling 

Pp«  (c°®  Si  )  h^t 


expression  (2. 99)  becomes 

=  pb.  (cosej  l  *+<£*}  4? 

P  qjS 

All  quantities  inside  the  square  brackets  are  constant  if  oc,  fi,  m  ,  and  q 
are  the  same.  Grouping  equal  factors  together 

=  P».  (cos  90  £  ict*+  (-l)3a+a  pTo  dpf 

*  p„  (cos  e, )  £ 1  £  ct*  of  (  ] 

where  D*  -  f  ^ 

p  =  o 
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There  are  (N3)  products  needed  to  form  all  d^;  (N2)  sums  to  compute  the 
2N  ,  and  (N)2  further  multiplications  bv  the  Fn„  (cos  0,)  to  form  all 

As  there  are  I  non-zero  G1  ,  there  arc,  per  matrix-vector  product, 
IxC^N3)  operations,  or  0(Na)  if  the  data  is  not  too  sparse.  M  ,  the  number  of 
matrix-vector  products  in  the  solution,  is  0(N ,  so  the  total  comes  to  0(Nb)  . 
Inverting  G  by  the  usual  methods  and  without  having  regard  for  its  structure 
involves  OfN6)  operations.  The  gain  inefficiency  is,  again,  O(N)  . 

Besides  providing  a  convenient  way  of  demonstrating  how  the  properties 
of  G  can  be  exploited  to  make  its  inversion  more  efficient  (or  the  solution 
of  the  normal  equations,  both  approaches  are  equivalent),  conjugate  gradients 
is  interesting  on  its  own  right.  Sparse  data  sets  with  a  poor  distribution  will 
result  in  ill-conditioned  normals,  so  the  inversion  of  G  may  be  numerically 
impossible.  So-called  iterative  methods,  such  as  conjugate  gradients,  usually 
improve  the  initial  guess  (represented  by  hp)  of  the  correct  values  of  the  un¬ 
knowns,  at  least  for  the  first  few  iterations.  Improvement  here  means  a  reduc¬ 
tion  in  the  quadratic  form  being  minimized,  such  as  the  mean  square  value  of 
the  residuals.  If  the  initial  guess  is  a  gcod  one,  and  present  day  spherical 
harmonic  coefficients  of  the  gravity  field  are  reasonable  good  for  degrees  up  to 
30  or  so,  then  a  few  iterations  are  likely  to  improve  this  guess, and  to  produce 
reasonable  estimates  of  those  coefficients  that, being  wholly  unknown,  are 
taken  to  be  zero  at  the  start.  If  a  "few”  iterations  are  much  less  than  the  max¬ 
imum  N3  ,  then  a  reduction  in  computing  time  of  0(Na)  takes  place.  This 
might  allow  scientists  to  "extend"  existing  models  to  much  higher  degree  end 
order  than  at  present,  simply  by  obtaining  approximate  solutions  of  this  type . 

Clearly,  all  that  has  been  said  here  regarding  G  and  least  squares 
adjustment  applies  equally  well  to  (G  +  C  *)  and  least  squares  collocation. 
Though  the  formulas  have  been  developed  on  the  basis  of  a  point  values' 
formulation,  their  extension  to  area  means  is  not  difficult. 
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While  the  descriptions  of  the  methods  for  estimating  the  Sf, 
discussed  here  have  been  confined  to  the  case  where  all  the  data  are  of  one 
kind  (i.e.,  gravity  anomalies  only,  or  magnetic  anomalies  only,  or  geopo- 
tentlal  numbers  •  etc.)  their  extension  to  mixed  data  sets  is  immediate, 
provided  that  all  values  are  globally  distributed  according  to  grids  of  the  same 
A  X ,  although  the  latitudes  need  not  be  the  same  as  well. 

Note  on  the  Accuracies  of  the  when  Using  Conjugate  Gradients; 

Besides  the  estimates  of  the  coefficients,  one  usually  wants  to  know  the 
accuracies  of  those  estimates.  When  a  few  iterations  of  conjugate  gradients 
are  used  to  update  some  initi al  estimates  in  the  efficient  way  described  above, 
what  are  the  accuracies  of  the  improved  coefficients?  In  the  course  of  the 
conjugate  gradients  procedure  (see  references),  the  conjugate  directions  v* 
of  G*  (At  D"1  A  +  C-1  +  Q71)  are  generated  (Ql1  is  the  variance  covariance 
matr  ixof  the  initial  estimates,  as  explained  in  paragraph  (2. 18)  part  (b)), 
together  with  the  scalars  v*  Gvn  =  ak  .  The  conjugate  directions  have  the 
property 


v  Ji  G  Vp  =  0 


for  k  ^  p  .  The  estimator  Implied  by  R  Iterations  of  this  procedure  is 
c  *  C?(AT  D"1  m  +  Q?c. )  =  G  AT  D"1  (z  +  n)  +  G  Q71  (c  +  Ac. ) 


where  c ,  is  the  vector  of  Initial  estimates,  Ac.  the  errors  in  this  vector,  and 

K  r 

r*  f  _  1  T 

G  =  2_  «ic  Vk  v„ 

k  -1 

The  variance-covariance  matrix  of  the  updated  errors,  for  ordinary  least 
squares,  is  (assuming  that  E  { n  Ac ,  |  =  0  ) 

E  j(G  AT  D_1  n)  (G  AT  D'1  n)T]+  E^(G  Q^Ac.)  (GQ-.1  Ac,  )T  ]•  =  GAT  D"1  D  D"1  A  G 

+  GQllQ.  Q71  <3  =  G  (At  D'1  A  +  Q',1)  3  =  GGG 

R  T  R  t 

=  r  a;  v,  ork  vk  afk1  =  V  ak  vt^  =G 

vt  =  l  k=l 


Therefore,  the  variances  of  the  errors  In  the 
ding  diagonal  elements  of  G  : 


* 


*  =  1 


3 

vk.  b«  a 


are  equal  to  the  correspon- 


where  v„jB,a  is  the  element  of  v  corresponding  to  in  d  (v  and  d 
have  the  same  dimension).  The  same  result  applies  to  least  squares  col¬ 
location.  All  that  is  needed  to  obtain  the  o^a  ,  according  to  the  formula 
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above,  is  knowledge  of  the  a.  k  and  of  the  v  *  ,  which  can  be  saved  as  they  are 
created,  during  the  kth  iteration  of  the  procedure.  For  the  efficient  algorithm 
to  be  applicable,  Q"1  must  have  a  suitable  structure.  One  case  is  when 
Ql1  is  diagonal,  or  can  be  satisfactorily  approximated  by  a  diagonal  matrix. 


2.17  The  Error  Matrix  in  the  Band-Limited  Case 


From  (2.41)  and  (2.77)  results 

Et  =  C  -  Co ,  (C„  +  Df1  CL  =  C  -  Fcl*  -  C  -  (AT  I)"1  A  C"1 )~!  AT  D"1  A  C 

=  [1  -  (AT  D-1  A  +  C"lf 1  AT  D_1  A]C  =  (At  D'1  A  +  C-1rt(AT  D'1A  +  C_1)-ATD_1AJC 

=  (AT  D"1  A  +  c-1)'1  (2. 101) 

In  the  case  of  the  best  unbiased  estimator  C'1  is  not  present  in  the  normal 
matrix,  so  (2. 101)  becomes 

Et  =  (At  D_1  A)-1  =  G"1  (2. 102) 


which  is  the  well  known  expression  of  the  error  matrix  for  ordinary  least 
squares. 

Because  of  the  block -diagonal  structure  of  the  variance-covariance  matrix!1' 
the  estimates  of  c“  of  different  orders  are  uncorrelated.  The  diagonal 
elements  of  the  variance-covariance  matrix  are  the  variances  of  the  estimated 
coefficients  total  eri'ors  (i.e. ,  sampling  plus  propagated  noise),  in  (2. 101). 

Obtaining  the  variances  of  the  CnB  in  band-limited  collocation  is  formally 
identical  to  getting  the  variances  of  the  estimates  in  ordinary  least  squares, 
according  to  (2.101)  and  to  (2. 102) 


2.18  The  Use  of  a  priori  Information  on  the  Coefficients 

Assume  that  all  coefficients  up  to  some  degree  and  order  M  are 
approximately  known,  and  that  Q,  is  the  variance-covariance  matrix  of  their 
errors.  This  could  be  the  case  where  a  model  of  the  gravity  field  has  been 
obtained  from  data  gathered  using  artificial  satellites,  complete  to  degree  M  , 
and  terrestrial  data  is  to  be  used  to  Improve  the  existing  coefficients  and  obtain 
new  ones  beyond  degree  M  . 

Three  possible  approaches  to  this  question  will  be  discussed  here,  using 
a  "point  values"  formulation  in  the  first,  two  cases  for  simplicity. 


When  the  data  set  is  not  sparse. 
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The  terrestrial  data  can  be  used  separately  to  obtain  a  model  to  degree 
and  order  MT  >  M  ,  together  with  the  variance-covariance  matrix  Qr  of 
the  coefficients'  errors.  To  combine  the  satellite  and  the  terrestrial  coef¬ 
ficients  one  can  set  up  the  following  observation  equations 
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(2. 103) 


where  c  is  the  vector  potential  coefficients  and  I,  is  the  (M  +  l)2  x  (M  +  l)a 
unit  matrix  augmented  with  zeroes  on  the  right,  and  IT  is  the  (Mt  +  l)2  x 
(Mr  +  l)3  unit  matrix,  while  c,  is  the  vector  of  featellite'boefficients  and 
c  t  the  vector  of 'terrestrial'  fcoefflc  ients ;  £  and  _t  are  the  corresponding 
vectors  of  residuals.  The  best  linear  unbiased  estimator  for  the  combined 
system  of  observations  is 
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(2. 104) 


If  Q,  and  QT  are  the  inverses  of  ordinary  least  squares  normal  ma¬ 
trices  G,  and  Gt  ,  then  the  error  matrix  of  the  combined  solution 

E  =  <Q-.4  +  QTtV1  =*  (G,  h-Gt)"1 

corresponds  to  the  propagated  noise  only.  If  they  are  "collocation" 
matrices  of  the  type  (AT  D"1  A  +  C’V^see  expression  (2. 77)),  then  the  error 
matrix  Includes  the  effect  of  the  sampling  error  as  well.  Most  satellite  models  are, 
to  date  ,  'least  squares-type"  and  it  would  be  incongrous  to  combine  them 
with  "collocation-type"  models,  terrestrial  or  otherwise.  The  problem  need 
not  be  a  serious  one,  because  geodetic  spacecrafts  so  far  have  orbited  at 
altitudes  of  800  km  or  more,  where  the  field  is  much  smoother  than  at  the 
surface,  so  the  sampling  errors  are  bound  to  be  small  compared  to  the  pro¬ 
pagated  data  errors  reflected  by  Q,  . 

In  the  case  where  the  terrestrial  model  has  been  derived  from  a  reg¬ 
ularly  sampled  data  set  using  the  "band-limited  approach"  of  previous  para¬ 
graphs,  Qt  is  block -diagonal,  and  those  blocks  corresponding  to  orders 
m  >  M  art?  identical  to  the  corresponding  blocks  in  (Q71  +  (g1  )-1  .  From 
this  it  is  not  difficult  to  conclude  that  the  coefficients  in  the  combined  model 
up  to  order  M  will  be  somewhat  different  (and  presumably  better)  than  those 
in  either  the  satellite  or  the  terrestrial  sets,  while  those  above  M  will  be 
identical  to  the  corresponding  terrestrial  coefficients. 


(b)  A  priori  values  included  as  data  in  the  adjustment: 
Consider  the  system  of  observation  equations 
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which  is  the  system  (2.66)  augmented  with  equations  of  the  type 
^m(S)  -  e^p)  =  Ca“ 

where  e£»(s)  the  error  in  the  "satellite  model"  coefficient  C . 

The  normal  matrix  of  collocation  is 

E71  =  ([AT  lt ]  TD"1  O  IfA-,  +  C'1  )  (2.106) 

-O  Qf^Ll.J 

and  the  optimal  estimator  of  the  band-limited  type  is 

c  -  (Ar  D"1  A  +  Q71  +  C" V1  (AT  D"1  m  +  Q?  c.)  (2. 107) 

Once  more,  if  the  data  in  m  has  been  sampled  regularly  on  the  sphere, 
the  estimated  coefficients  will  be  affected  by  the  existence  of  a  priori  values 
only  if  their  order  is  no  higher  than  M  .  Naturally,  this  is  a  desirable  sit¬ 
uation.  Also  it  is  important  that  the  error  measures  corresponding  to  ET 
and  Q(  be  congruous,  though  this  is  probably  not  very  Important  in  the  case 
of  satellite  models  obtained  with  high-orbiting  spacecraft.  Notice  that  (a) 
and  (b)  are  equivalent  when  Qr  =  (AT  ET  1  A)-1  and  when  C-1  is  excluded 
from  (2, 106)-(2. 107).  In  other  words:  these  first  two  approaches  are  equiv¬ 
alent  for  ordinary  least  squares. 

(c)  The  method  of  Kaula  and  Rapp? 

W.  Kaula  (1966)  proposed  a  technique  for  simultaneously  filtering  er¬ 
rors  out  of  a  terrestrial  data  set  and  improving  Ihe  coefficients  of  a  satellite 
model.  This  method  was  later  developed  by  R.  Rapp  (1968),  who  more  recently 
(1978)  used  it  to  Improve  a  global  data  set  of  mean  1°  x  1°  anomalies  by  com¬ 
bining  it  with  the  potential  coefficients  of  the  GEM-9  model.  This  adjusted 
data  set  was  used  by  the  author  of  this  report  to  create  the  5°  x  5°  mean  anom¬ 
alies  analysed  in  one  of  the  numerical  experiments  of  section  3. 

The  idea  is  to  satisfy  condition  equations  of  the  type 

-  (4 Try  (n  -  1)  j3a  S9*")-1  Z  L  I  ^f(M)d<*(AgtJ  +  vtJ)  *  0 

i*o  j*o  »frt }  (2.108) 
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while  minimizing  the  quadratic  form 


g(d,  v)  =  dT  Q-,1  d  +  vT  D"1  v  (2. 109) 

where  v  is  the  vector  of  corrections  vt  3  to  the  mean  gravity  anomalies 
Agij  ,  and  d  is  the  vector  of  corrections  to  the  satellite  potential  coef¬ 
ficients  &  is  the  nth  degree  Pellinen  factor  discussed  in  paragraph 

(4. 3),  y  is  the  mean  value  of  equatorial  gravity,  and  S  is  the  ratio  between 
the  radius  of  the  Earth's  largest  inner  geocentric  sphere  and  the  mean  Earth 
radius.  In  matrix  form,  the  condition  equations  (2. 108)  are 

c,  +  d  -  At  5g  -  At  v  =  0  (2. 110) 

where  A  is  a  2N2  x  (M  +  1) 3  matrix  having  columns  a^„  of  the  form 

ap“  =  (4n(n  -ljyS^&f1  [f  Yn£(9,A)da.  .  .  f  YB“(9,A)dcr)T 

•)Oq  o  vO N_1  3M_1 

so,  except  for  the  factor  (4rr(n  -  l)ySn+a  j3n)-1  ,  A  is  the  "area  means  version" 
of  the  matrix  of  system  (2. 66),  and  has  the  same  properties  as  the  "A"  mat¬ 
rices  considered  so  far. 


The  optimal  estimates  are  given  by  the  expressions 
c  =  c.  +  d  (2. 111-a) 


=  Ag  +  v 


(2,  lll-b) 


where 

d  =  -((A  D  A)"1  +  Q71)"1  (At  D  A)“l  (c,  -  A*  5g) 


(g.  112) 


and 

v  =  I)  A  Q,  d  (2.113) 

If  the  data  set  is  both  complete  and  of  uniform  quality,  the  matrix  ATDA 
has  the  block  structure  first  discussed  in  paragraph  (2.15).  The  presence  of 
D  Instead  of  D'1  makes  no  difference  to  the  calculations  needed  to  set  up 
and  invert  the  matrix:  the  procedures  are  those  already  explained.  Ter¬ 
restrial  data,  however,  is  usually  both  scattered  and  of  varying  quality 
(i.e.,  different  noise  variances).  For  this  type  of  data,  therefore,  the  meth¬ 
ods  for  scattered  measurements  given  in  paragraph  (2,16)  could  be  used. 


2. 19  Optimal  Estimation  over  a  Band  of  Spatial  Frequencies 
Assume  that  the  signal  is  of  the  type 
m  =  A  c  +  n 
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where  A  and  c  may  now  he  infinite  (i.e.,  all  degrees  from  0  to  * 

maybe  present).  If  c '  is  a  sub-vector  of  c  comprising,  say,  the  first 

1?  coefficients,  and  if  £  is  a  vector  of  estimates  of  a  function  s  at 
a  given  set  of  points  on  the  sphere,  such  that  the  values  of  s  depend  only  on  those 
of  c'  according  to  the  relationship 

8  ■  Be1  (2. 114) 

where  B  is  some  matrix  of  appropriate  size,  not  necessarily  of  the  same  type 
as  A  ,  then  the  optimal  estimator  for  £  is,  according  to  (2.39),  (2.75), 

i  =  C,.(C,*  +  Df 1  m 

=  M{s  zTl  (C„  +  Df1  m  =  M{B  o'  c'  A  (C„  +  Dfx } 

=  Be' A1* (C„  +  D)"1  m  (wnere  C’  =  M-fc*  c,T  i-  y 

=  B  C0-  X  (Cr,  +  Df 1  m  1  j 

or 

|  »  Be-  (2.115) 

according  to  (2.40). 

Expression  (2. 115)  Indicates  that  the  optimal  estimates  of  a  band- 
limited  function  s  from  data  m  are  identical  to  the  values  of  a  obtained 
from  the  optimal  estimates  of  the  coefficients  c'  by  means  of  the  rela¬ 
tionship  1=  b£  . 


-67- 


3.  Numerical  Examples 

This  section  presents  several  computed  examples  to  Illustrate  some 
of  the  Ideas  and  methods  discussed  earlier  on.  The  question  posed  here  is, 
basically,  that  of  the  accuracy  of  the  various  procedures,  and  Is  answered 
by  means  of  error  analysis  carried  out  with  the  formulas  for  error  variance 
developed  in  section  2,  and  also  by  analysing  simulated  data  and  comparing 
the  recovered  coefficients  to  the  original  ones  in  order  to  find  the  actual 
errors.  A  comparison  of  the  rms  of  these  errors  with  the  theoretical  rms 
(i.e.,  the  square  root  of  the  variance)  provides  both  a  check  on  each  set  of 
results  and,  more  important,  shows  just  how  adequate  an  error  measure 
the  theoretical  rms  can  be.  Besides  error  analysis  and  simulations, this 
section  shows.  In  the  last  paragraph,  the  results  of  the  harmonic  analysis 
of  a  real  data  set:  a  5°  x  5°  equal  angular  set  of  mean  gravity  anomalies 
covering  the  whole  Earth,  from  which  the  coefficients  of  the  disturbing 
potential  have  been  recovered  to  degree  and  order  36  by  means  of  least  squares 
collocation,  using  the  "Toeplitz  matrix"  approach  of  paragraphs  (2. 10)  and 
(2.11). 


3.1  Generation  and  Analysis  of  Simulated  Data 

As  explained  in  the  proceeding  section,  the  variance  of  the  error  in 
the  estimate  of  C®  depends  on  the  power  spectrum  (or  covariance  function) 
of  the  signal,  and  on  the  variance-covariance  matrix  of  the  noise.  The  pro¬ 
pagation  of  the  noise  is  quite  straightforward,  and  anybody  who  has  had  any 
practical  experience  with  adjustments  of  geodetic  networks  and  the  like  al¬ 
ready  has  enough  "feeling"  for  this  part  of  the  error  measure,  and  is  cap¬ 
able  of  understanding  its  significance  when  its  value  is  given  to  him.  The 
part  corresponding  to  the  sampling  error  is  somewhat  different;  it  involves 
a  rather  unusual  geometric  average  over  rotations,  and  this  type  of  error 
measure,  while  not  exactly  new  (collocation,  based  on  this  measure,  has 
been  around  since  the  mid-sixties)  is  not  so  familiar  to  geodesists  yet,  and 
its  use, in  harmonic  analysis  in  particular,  far  from  common  practice.  For 
this  reason,  it  Is  probably  fair  to  the  readers  to  provide  some  illustration  of 
how  "close"  this  part  of  the  error  measure  is  to  the  actual  sampling  error 
that  occurs  when  data  of  the  assumed  power  spectrum  is  analyzed  in  any  of 
the  ways  discussed  so  far  to  recover  spherical  harmonic  coefficients.  By 
"close"  one  means  that  the  actual  numbers  measuring  the  theoretical  and  the 
actual  variances  (or  rms)  should  differ  from  each  other  by  a  small  percentage, 
or  some  equally  clear-cut  criterion. 

The  theoretical  variance  considered  here  is  the  variance  of  the  esti¬ 
mation  errors  per  degree  (T^n ,  defined  in  terms  of  the  error  measure  of 
section  2  as  fallows  (see  paragraph  (2.8)) 
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The  rms  of  the  error  is  the  square  root  of  this  variance,  and  the  ratio  of 
this  rms  to  the  rms  per  coefficient: 
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multiplied  by  100  ,  or  the  percentage  rms  error  per  degree,  is  the  theo¬ 
retical  quantity  to  be  compared  to  the  "actual”  percentage  rms  error  per 
degree  derived  from  the  analysis  of  simulated  data  with  the  same  statistical 
characteristics  (i.e. ,  a3',  ,  C,t ,  and  D)  in  formula  (3.2). 

The  a|n  were  computed  with  subroutine  NORMAX  (Appendix  B). 


To  obtain  the  actual  percentage  rms  error  per  degree,  sets  of  simulated 
data  were  created  on  full  regular  grids  as  follows!  the  artificial  data  con¬ 
sisted  of  area  means  computed  globally,  using  the  algorithm  outlined  in  para¬ 
graph  (1. 7)  and  subroutine  SSYNTH  (Appendix  B),  on  the  basis  of  expression 
(1.2).  The  C“  ,  complete  to  degree  and  order  N„x  »  N  =  came  from 

sequences  of  random  numbers.  The  random  numbers,  obtained  using  the  IMSL 
subroutine  "GGNOR"  with  gene  rating  rseeds”of  the  order  of  104  ,  were  sealed 
to  give  them  the  desired  degree  variances  or„a  .  For  each  simulation,  a  se¬ 
quence  of  (NBaX  +  l)3  numbers  was  obtained,  the  first  corresponding  to  c0o  , 
the  second,  third,  and  fourth  to  C 1 0 ,  Cu  ,  and  Sn  ,  respectively,  and  so 
forth.  If  rn0  ,  r®!  ,  .  .  .  rn“  were  the  (2n  +  1)  numbers  corresponding  to 
degree  n  ,  then  the  scaling  that  resulted  In  the  corresponding  CnaB  was 


(3. 3) 


The  harmonic  coefficients  C®  obtained  in  this  way  were  the  ”actual"  coef¬ 
ficients  to  which  the  recovered  by  some  of  the  procedures  described 
in  section  2,were  then  compared  to  obtain  the  actual  percentage  rms  errors 
per  degree 

Cn  =  l  t  I  (££  "  X  100  (3.4) 


The  analysis  of  tho  simulated  data  was  done  with  subroutine  HARM  IN  (appendix 
B).  This  type  of  numerical  experiment  was  carried  out  three  or  more  times 
in  each  case,  varying  only  the  seed  used  to  generate  the  random  numbers, much 
as  a  Montecarlo-type  of  analysis  Is  conducted.  The  seeds  were  chosen  widely 
apart,  to  ensure  that  the  correlation  between  "trials”  would  be  virtually  nil. 
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The  maximum  degree  and  order  in  the  set  of  artificial  coefficients, 

NBa  x  »  was  chosen  so  that  the  power  in  the  mean  values  above  degree  NBa  » 

2000 

Pn..x  =  f  ft  C,*  (3.5) 

k>tl 

were  lessthanl%  of  the  power  between  degrees  0  and  2000.  The  ^ 
are  the  Pellinen  coefficients»discussed  in  paragraph  (4.3),  corresponding 
to  5°  x  5°  area  means.  The  values  used  for  the  degree  variances  crn  had  been 
empirically  obtained  from  terrestrial  data  in  the  manner  described  below. 

The  data  were  supposed  to  be  noise-free,  as  only  the  sampling  part  of  the 
error  was  studied  in  this  way,  for  the  x'easons  given  at  the  beginning  of  this 
paragraph.  The  estimators  being  linear,  the  propagated  noise  and  the  sampling 
error  merely  add  arithmetically  to  each  other,  and  can  therefore  be  studied 
separately,  if  so  desired.  Summing  up.  It  can  be  said  that  the  simulated 
data  consisted  in  global  data  sets  of  artificial  gravity  anomalies,  averaged 
over  equal  angular  grids. 


The  empirical  degree  variances  were  obtained  as  follows:  up  to  degree 
100  they  were  those  Implied  by  a  set  of  coefficients,  complete  to  degree  180, 
obtained  by  R.  Rapp  and  associates  at  O.S.U.  from  a  global  data  set  of 
1°  x  1°  mean  anomalies.  Above  degree  100,  the  crB  (Ag)  were  obtained  from 
a  model  of  the  form 


<V(Ag)  =  (n 


1  <n  +  A) 


(3.6) 


(Moritz,  1976),  where  the  parameters  ,  as  ,  Sa  ,  Sa  ,  A  and  B  have 
been  adjusted  to  fit  existing  gravimetric  data,  satellite  altimetry,  satellite 
field  models,  and  other  geophysical  data.  The  parameters  used  in  most 
examples  were 


=  3.4050  =  0.998006  A  =  1. 

a  3  =  140.03  S9  =  0.914232  B  =  2. 


corresponding  to  the  best  model  of  this  type  given  in  a  report  by  R.  Rapp 
(1979)  who,  in  the  same  work,  discusses  also  the  empirical  degree  variances 
obtained  from  his  180,  180  field  model.  The  degree  variances  implied  by  (3-6) 
with  the  parameters  listed  above  are  also  very  similar  to  those  obtained  by  quite 
different  means  by  Wagner  and  Colombo  (1979),  who  analyzed  the  (Fourier)  power 
spectrum  of  short  arcs  of  GEOS-3  altimetry,  and  converted  their  average  to 
a  spherical  harmonics  spectrum  using  formulas  that  follow  from  the 
relationship  between  spherical  harmonics  and  Fourier  series.  The  empir¬ 
ical  variances  for  n  £  100  are  included  in  the  listing  of  subroutine  NVAR, 

In  Appendix  B. 

In  order  to  understand  how  critical  the  choise  of  empirical  degree  variances 
is  to  the  theoretical  and  actual  errors,  a  different  two-term  model  obtained 
by  C.  Jekeli  (1978)  was  used  as  well.  This  model  has  the  following  parameter 
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values 


=  18.3906  Si  =  0.9943667  A  =  140. 

OCa  =  658.6132  S3  =  0.9048949  B  =  10. 

All  the  examples  considered  In  this  section  refer  to  complete  equal  angular 
sets  of  mean  values  with  the  same  statistical  properties  of  terrestrial  gravity 
anomalies  (as  far  as  such  properties  are  known).  The  analysis  of  actual 
mean  gravity  anomalies  is  shown  in  the  last  paragraph.  Besides  being  im¬ 
portant  in  geodetic  studies,  gravity  anomalies  constitute  a  type  of  geophysical 
data  with  reasonably  well  known  statistical  properties,  and  their  study  here 
is  meant  to  give  the  reader  some  idea  of  how  effective  are  the  ideas  presented 
earlier  when  it  comes  to  handling  "real  data"  (  or  something  resembling  it). 


3.2  Agreement  between  the  Actual  and  the  Theoretical  Measures  of  the 
Sampling  Errors 

Table  (3. 1)  lists  side  by  side  the  theoretical  precentage  rms  per  degree 
of  the  sampling  error  according  to(3.2)  and  the  actual  value  of  this  percen¬ 
tage  for  two  different  sets  of  coefficients  (i.e.,  from  random  sequences  with 
different  seeds).  The  coefficients  were  recovered  using  the  quadratures  for¬ 
mula 

^  j  fjr-  X  P 

<£.  -  dr  E  £  (e.Mdc- 

P*  iso  •'Olj 

This  type  of  formulas  has  been  discussed  in  paragraph  (2.6).  The  simulated 
data  consisted  in  full  sets  of  5°  x  5°  mean  anomalies  obtained  from  harmonic 
coefficients  complete  to  degree  and  order  N»»*  -  140  .  The  power  above 
degree  140  in  5°x  5°  anomalies  is  negligible,  according  to  the  empirical 
power  spectrum  model  that  was  used.  The  results  shown  here  are  fairly 
typical  of  similar  tests  conducted  with  other  quadrature  formulas,  so  the 
conclusions  that  can  be  drawn  are  likely  to  be  valid  for  the  analysis  of  area 
t  means  by  numerical  quadratures  in  general.  There  is  clear  agreement 

|  between  the  theoretical  and  the  actual  rms  of  the  errors,  and  not  just  the 

[  average  rms  of  actual  errors,  but  the  actual  rms  of  each  trial  as  well.  The 

agreement  is  close,  and  the  reader  will  probably  agree  that  to  use  a  theoretical 
error  measure  that  can  predict  the  actual  error  so  well  is  a  meaningful  way 
of  quantifying  the  error. 


3.3  Accuracies  of  Various  Quadratures  Formulas 

Five  quadratures  formulas  for  area  means  have  been  studied:  the  first 
four  of  the  type 

n-i  an- 1 


n-i  aft- 1  * 

Z  Z  Y®  (9,A)dq  Ag„ 

i-o  «0  Jcl} 
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Table  3,1 

Comparison  of  actual  versus  theoretical  percentage  rms 
error  per  degree. 

5°  x  5°  mean  anomalies,  N„x  =  140,  0.  mgal  rms  noise. 


n  Actual,  No  1  Actual,  No  2  Average  Theoretical  (expres- 
(seed=53218)  (seed  =  31765)  1  and  2  sion  (3.2)  xlOO) 


2 

!  0.50 

0.  62 

1  0. 56 

0.51 

5 

1.06 

1.10 

1  1.08 

1 . 23 

10 

4.99 

5.  81 

5.40 

4.89 

15 

13.06 

11.03 

12.05 

12.75 

20 

23.48 

25.97 

24.  72 

24.  77 

25 

23.04 

30.82 

26.93 

30. 03 

30 

45.84 

46.26 

46.05 

43.28 

*36(N) 

55.33 

61.10 

58.22 

60.35 

40 

70.61 

75.82 

73.22 

75.14 

45 

85.91 

86.41 

86.16 

87.03 

50 

101.51 

101.43 

101.47 

101.47 

differ  only  in  : 

(a)  /xn  =  the  optimal  de-smoothing  factor  given  by  expression  (2. 3G)  in  para¬ 

graph  (2.7): 

(b)  Mn  =  4^f 

(C)  4^ 

(d) 

(e)  The  optimal  quadratures  -  type  formula,  in  the  least 

squares  collocation  sense  (i.e.,  minimum  combined  error  measure)  for 
the  given  grid;  si<”  >1  and  noise  (paragraph  (2. 8)).  The  grid  was  equal 
angular  in  all  five  ,es.  Table  (3.2)  compares  the  percentage  rms  of  the 
errors  per  degree  for  a  30°  x  30°  grid;  table  (3. 3)  corresponds  to  a  10°  x  10° 
grid  ;  and  table!  (3.4)  to  a  5°  x  5°  grid.  N„,x  was  100  for  the  first  two  tables, 

'* n«!  140  for  the  last.  All  these  values  are  theoretical,  computed  in  accordance 
r  ‘  i. •  f-innu !:«.*;  La  paragraph  (3.1).  In  all  three  cases  noise  is  not  present, 

•  ■  rr  rs  are  purely  sampling  errors. 

•  i  :  I  i.  5)  corresponds  to  a  5°  x  5°  grid  and  an  uniform  noise  of 
'■  *!-'■  •  ffoc  t  of  the  noise  has  been  included  in  the  results. 

.  <yoe  correlation  coefficients  for  the  various  methods, 

•  .  ■  ■  '■  rri  lat  ton  coefficient  for  the  nth  dcgx’ce  is  defined  as 


.  *♦ 


it  <d. 


(3.7) 


P»  =  l 


i  t  cf.  4“  <£  t 

I  =  o  Q(=  0  *  =  0  fits  n 


«a9 


*=  0  flja  g 


and  it  is  also  equal  to 


A  = 

r  U  Ag„  Afe,d  a 

~Jcr  Ag?dc r/ff  Agf  da- 

( i  1 4?.a  r‘  l4 

a  =0  /v=  O 


(3.8) 


This  coefficient  can  be  regarded  either  a  a  measure  of  the  agreement  between 
the  actual  and  the  recovered  coefficients  of  the  nth  harmonic,  or  of  between 
the  nth  harmonic  Ag„  in  the  signal  and  the  harmonic  Ag„  that  can  be  computed 
from  the  recovered  coefficients,  if  the  could  be  seen  as  random  variables 
with  gaussian  distribution,  the  interpretation  of  ft  would  move  along  well-worn 
paths;  however,  as  it  was  mentioned  in  paragraph  (2. 5),  there  are  some  unex¬ 
pected  problems  when  extending  the  idea  of  a  gaussian  random  process  to  the 
sphere,  so  is  better  to  chose  another  approach.  One  could  regard  the  coef¬ 
ficients  of  the  nth  harmonic  as  the  coordinates  of  a  vector  in  (2n  +  1) -dimen¬ 
sional  euclidean  space.  The  actual  coefficients  will  define  thus  one  vector, 
and  the  recovered  coefficients  another.  Expression  (3. 7)  then  merely  defines 
ft,  as  the  scalar  product  of  this  two  vectors.  Likewise,  expression  (3.8)  is 
that  of  the  scalar  product  of  two  elements  of  a  function  space.  The  angle 
formed  by  these  vectors  is  0°  when  correlation  is  (maximum),  and  90°  for 
0  correlation;  a  minimum  correlation  of  -1  corresponds  to  the  case  when 
the  vectors  are  equal  but  of  opposite  sense.  The  scalar  product  is  independent 
of  any  scale  factors  that  may  multiply  the  vectors:  it  depends  only  on  their 
mutual  orientation.  For  this  reason,  the  correlation  coefficients  are  the  same 
for  the  four  quadratures  formulas  of  the  type 


* 


N*1 

■  «*F 
1  =  0 


l 


SN-  1 

r  Agi,/  7a~  (6,  X)  dc 

J  =  o 


because  the  difference  between  the  errors  for  the  same  harmonic,  predicted  with 
two  different  formulas  of  this  kind,  consists  in  a  scale  factor  . 

Clearly,  as  the  rms  of  the  error  Increases  with  n  ,  ft,  decreases  from  almost 
1  where  the  error  is  smallest  (very  low  degrees), to  below  0. 5  where  the 
error  exceeds  90%  (highest  degrees  analyzed). 


Observing  the  percentual  rms  of  the  errors  in  the  first  four  tables,  it 
is  easy  to  see  that  they  by  no  means  reach  100%  as  soon  as  the  Nyquist 
frequency  is  reached  (n  =  N),  but  that  they  remain  substantially  smaller  than 
100%  even  at  degrees  considerably  higher  than  N  ;  this  is  in  line  with  the 
conclusions  in  paragraph  (1. 3).  The  optimal  estimator  itself  cannot  have  an 
error  larger  than  10 0%  ,  be  it  due  to  sampling,  noise,  or  both.  Otherwise, 
a  null  estimator  (one  that  predicts  only  zeroes)  would  be  better  than  the  op¬ 
timal,  which  is  not  possible. 


Table  (3.7),  compares  the  theoretical  errors  with  zero  noise  (i.e. , 
the  sampling  errors)  of  the  collocation  estimator  obtained,  first  according  to 
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<7*  implied  by  R.  Rapp's  model  (used  in  all  the  other  tables),  and  then  ac¬ 
cording  to  the  ”2L"  model  of  C.  Jeleky,  both  described  in  paragraph  (3.1). 

This  table  is  included  here  to  give  the  reader  an  idea  of  how  sensitive  the  theo¬ 
retical  error  variances  are  to  the  empirical  degree  variances  used  to  compute 
them.  The  "2L"  model  has  considerably  more  power  than  Rapp's  at  high  degrees, 
and  this  may  be  reflected  in  the  somewhat  larger  errors  in  the  corresponding 
column  of  the  table. 


Table  3.2 

Theoretical  percentage  rms  error  per  degree. 


30°  x  30°  mean  anomalies. 


=  100,  0.  mgal  rms  noise. 


Optimal 

Estimator 


13. 

,17 

25. 

,60 

65. 

,13 

96. 

,01 

99. 

,27 

1  _1_ 

4tt)3b  4h  4rrjSaa 


13.57 
28.72 
101.41 

194.08  97.13  882.98 

915.49  105.44  17835.58 


Table  3.3 

Theoretical  percentage  rms  error  per  d  egree. 

10°  x  10°  mean  anomalies,  N,«  x  =  100  ,  0.  mgal  rms  noise. 


Optimal 

Estim. 


1.52 
2.24 
4.27 
8.74 
13.19 
31.60 
40.43 
41.  41 


* 

Mb 

4tt 

1 

4  TT 

1 

4  TTjSf 

1.63 

1.71 

2.04 

1.65 

2.46 

3.00 

4.76 

2.56 

4.68 

5.83 

9.51 

4.94 

9.60 

10.99 

16.39 

10.16 

14.02 

15.  89 

23.68 

15.19 

33.03 

33.10 

37.84 

37.43 

41.41 

41.41 

46.45 

49.91 

42.18 

42.36 

51.41 

53. 12 

59.94 

61.26 

63.41 

88.17 

78.99 

94.69 

79.07 

158.26 

78.54 

89.75 

79.05 

167.  49 

87.40 

116.57 

87.59 

258.09 

94. 03 

164. 50 

97.85 

443.76 

95.98 

193.17 

100.35 

646.88 

97.30 

232.24 

101.88 

1010. 13 

98.30 

303.  89 

103.49 

1828.98 

97.40 

281.73 

97.77 

2619.98 

98.08 

485.72 

98.45 

'  8694.11 

Table  3.4 

Theoretical  percentage  rms  error  per  degree 


5°  x  5°  mean  anomalies, 


Optimal 

n  Estim.  u. 


140  ,  0.  mgal  rms  noise. 


0.47 

1.36 

9.74 

15.14 

30.58 

42.88 


0.49 
1.44 
10.11 
15.  55 
31.06 
43.28 
59.15 
73.51 
83.62 


Table  3.  5 

Theoretical  percentage  rms  per  degree 

5°  x  5°  mean  anomalies,  Na»  x  =  140  ,  5.  mgal  rms  noise. 

n 

Optimal 

Estim. 

+ 

/in 

-1  - 
4tt  J3n 

1 

4  TT 

2 

8.74 

8.81 

8.84 

8.83 

8.85 

6 

9.33 

9.34 

9.34 

9.43 

9.41 

12 

32.80 

32.88 

33.79 

33.02 

35.28 

18 

35.94 

36.04 

36. 54 

36.25 

39.86 

24 

53.32 

53.47 

56.73 

53.49 

66.99 

30 

61.97 

62.14 

67.15 

62.15 

87.22 

36  (N 

71.98 

72.62 

82,81 

72.63 

122.67 

42 

81.55 

82.17 

102.91 

82.54 

181.19 

48  j 

88.72 

124.09 

89.54 

271.87 

Table  3.  6 

Correlation  factor  per  degree. 

5°  x  5°  mean  anomalies,  Nns  ,=  140,  0.  mgal  noise. 


n  Optimal  Estim.  Simple  quadratures 


2 

6 

12 

18 

24 

30 

36  (N) 
42 

48 


Table  3.  7 

Comparison  between  the  theoretical  errors  (optimal  esti- 
mator)when  different  covariance  functions  are  assumed. 
Naas  =■  140  ,  5°  x  5°  mean  anomalies,  0.  mgal  noise. 


The  results  in  the  tables  show  that  the  optimal  estimator  errors  are  the 
smallest,  as  expected,  and  that  the  quadratures  formula  with  the  optimal  de¬ 
smoothing  factors  Is  the  best  of  the  four  simple  quadratures  formulas  compared  here, 
though  it  is  not  quite  as  good  as  the  optimal  estimator,  also  as  expected.  In 
the  case  of  zero  noise  each  of  the  three  non-optimal  quadratures  formulas 
has  errors  that,  in  some  region  of  the  spectrum,  are  smaller  than  those  for 
the  other  two;  concretely,  the  de-smoothing  factor  works  better 

for  low  degree  harmonics  (or  n  s;  -J-N),  i.e. ,  the  percentage  Sins  of  the 
error  is  less  than  for  the  other  two  formulas  there,  the  factor  pa  = 
is  best  for  middle  harmonics  (approximately  l/3N<n  £  N),  and 
/i„  =  is  best  above  the  Nyquist  frequency  N  .  It  is  possible,  therefore, 
to  obtain  a  simple  "composite"  quadratures  formula  that  combines  the  good 
properties  of  all  three  formulas,  by  defining  its  de-smoothing  factor  as  follows; 

|  )3a  if  0  s  n  *  1/3N 

M a  =  where  rja  *  l  /3a  if  1/3N  <n  s  N  (3.9) 

Va  (l  if  n  >  N 


This  composite  formula  has  been  implemented  in  the  version  of-subroutine 
HARMIN  listed  in  Appendix  B,  through  the  subroutine  can  be  easily  changed 
to  compute  other  quadrature  formulas. 


It  is  clear  from  the  tables  that,  while  better  than  all  the  others,  the 
optimal  formula  is  only  marginally  so;  from  a  practical  point  of  view,  the 
simple  "composite"  formula  (3.9)  above  Is  virtually  as  good,  but  it  is  much 
easier  to  implement  and  compute.  Therefore,  when  analysing  data  of  the  type 
considered  here  (resembling  mean  anomalies  with  uniform  noise  on  an  equal 
angular  grid),  the  quadratures  formulas  discussed  the  composite  In 
particular,  are  about  as  good  as  any  linear  technique  for  estimating  the  coef¬ 
ficients,  and  also  very  easy  to  program  and  very  efficient. 


There  may  be  cases,  however,  when  this  Is  not  true.  If  the  data  set 
were  very  noisy  and/or  Incomplete,  or  if  the  coefficients  to  be  recovered  were 
not  those  of  the  signal,  but  those  of  some  complicated  transformation  of  it 
(as  in  the  case  of  satellite-to-satellite  tracking  data,  where  the  signal  depends 
on  a  combination  of  differences  ox  raaiai  and  horizontal  derivatives  of  the  grav¬ 
itational  potential)  so  no  simple  Integral  formula  like  (1.4)  exists  that  can  be 
readily  discretized  into  summations  like  (1.5)  or  (1.6),  then  simple  quadrature 
formulas  like  those  in  section  1  would  be  of  no  real  use,  and  the  optimal  es¬ 
timator  could  provide  the  only  practical  way  of  obtaining  (he  coefficients.  Both 
collocation  and  least  squares  adjustment  are  very  similar,  as  shown  in  section 
2,  and  both  can  be  implemented  with  reasonable  efficiency  if  nothing  simpler 
is  available.  More  important,  the  optimal  formulas  provide  a  theoretical 
background  on  which  one  can  build  a  coherent  and  comprehensive  understanding 
of  the  other  linear  techniques  for  spherical  harmonic  analysis.  It  is,  after  all, 
because  of  the  theory  developed  in  the  previcu  s  section  that  it  has  been  possible 
to  obtain  the  results  shown  in  the  proceeding  tables,  results  that  constitute  the 
factual  basis  for  these  considerations. 

The  techniques  for  setting  up  and  inverting  the  variance-covariance 
matrix  of  the  data  are  of  interest  in  a  number  of  estimation  and  filtering 
problems,  besides  harmonic  analysis.  The  author  hopes  that  the  examples 
provided  in  this  section  will  encourage  ihe  wider  use  of  least  squares  col¬ 
location  for  the  processing  of  large,  global  sets  of  data,  both  point  values 
and  area  means.  Creating  the  simulated  mean  anomalies  with  subroutine 
SSYNTH,  obtaining  the  "weights"  x“  of  the  optimal  quadratures  formula  with 
NORMAL^  (Appendix  B),  and  recovering  the  b®,  for  comparison  with  the  or¬ 
iginal  C„  up  to  degree  and  order  72  (same  5000  coefficients)  on  a  5°  x  5°  degree 
grid  (about  2600  "data"  values)  took  less  than  20  seconds,  central  processor 
unit  time,  in  the  AMDHAL  computer  at  Ohio  State.  To  recover  the  harmonic 
coefficients  up  to  degree  and  order  130  from  a  complete  equal  angular  set  of 
1°  x  1°  mean  values  would  require  less  than  two  hours,  using  the  same  machine. 
Most  of  this  time  would  be  dedicated  to  creating  and  inverting  (CIt  +  D). 

Butin  the  optimal  estimation  and  filtering  of  geoidal  undulations,  deflections 
of  the  vertical,  and  any  other  function  of  the  gravity  field  estimable  from  (say) 
gravity  anomalies,  the  fact  that  (C,*  +  D)  can  be  set  up  and  inverted  efficiently 
transcends  harmonic  analysis.  A  major  Implication  Is  that  such  extremely 
large  global  adjustments  require  a  computational  effort  that  is  already  within 
the  reach  of  most  researchers. 

As  mentioned,  already  In  section  2,  matrix  (Clf  +  D)  may  become  poorer 
conditioned,  i.  e.,  its  numerical  inversion  less  stable,  as  the  data  distribution 
becomes  denser.  This  tendency  towards  instability  was  noticed:  when  there  was 
no  noise  (D  =  0)  the  B(m)  matrices  had  to  be  regularized  by  adding  a  small 
positive  constant  k  to  each  diagonal  term  (Colombo,  1979,  par.  (4. 5))  before 
they  could  be  Inverted  successfully.  This  constant,  which  in  most  cases  was 
much  smaller  than  the  diagonal  elements  it  was  being  added  to,  was  lCf6  for 
30°  x  30°  and  10°  x  10°  ,  but  had  to  be  increased  to  10*°  in  the  case  of  a 
5C  x  5°  grid.  When  noise  was  present,  the  nonzero  diagonal  elements  in  D 
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were  sufficient  to  provide  stability,  and  no  regularization  was  needed.  Be¬ 
cause  of  this  tendency  to  instability,  the  "band-limited"  approach  of  paragraphs 
(2. 13)  to  (2. 17)  may  be  preferable,  whenever  it  can  be  properly  applied. 

As  shown  in  Table  (3. 1),  the  theoretical  and  the  actual  sampling  errors 
are  almost  the  same  in  most  cases.  The  propagated  noise  measure  is  very 
easy  to  compute  in  the  case  of  uncorrelated  noise,  and  can  be  added  to  the 
actual  sampling  error  (variance)  to  get  an  estimate  of  the  total  error,  both 
actual  and  theoretical.  This  estimate,  where  the  sampling  part  is  the  result  of 
a  Montecarlo-like  approach,  is  much  easier  to  obtain  than  the  theoretical  one 
that  involves  setting  up  matrix  (C 2Z  +  D),  or  at  least  the  R(in)  matrices.  In  the 
case  of  equal  angular  data  sets  like  those  considered  here,  this  empirical  es¬ 
timate  is  likely  to  be  just  as  accurate,  when  it  comes  to  judge  the  performance 
of  any  given  type  of  harmonic  analysis.  Such  estimate  has  been  used  to  evaluate 
the  likely  errors  in  the  potential  coefficients  obtainqdj^rom  1°  x  1°  mean  anom¬ 
alies  using  the  quadratures  formula  6^  =  £0/ou  vfi  (0,X)dcr  /Sgij  . 

The  Montecarlo  method  described  in  paragraph  ^5".  1)  was  implemented  with  the 
help  of  subroutines  SSYNTH  and  HARMIN,  the  error  variance  being  equated 
to  its  average  over  three  "trials"  (three  sets  of  coefficients  created  from  dif¬ 
ferent  random  sequences),  C.  Jeleli,  also  at  O.S.  U. ,  who  undertook  this  work 
as  part  of  his  own  research.fitted  a  quartic  to  the  percentage  rms  per  degree 
thus  obtained.  When  this  was  expressed  as  a  function  of  the  "normalized  de¬ 
gree”  n/N  ,  the  quartic  fitted  equally  well  the  theoretical  results  for  30°  x  30°  , 
10°  x  10°  ,  and  5°  x  5°  presented  in  this  section,  jekeli's  quartic  expression 
for  the  truncation  error  is  (private  communication) : 

x  100  =  [(-16.19570  (|j)  +  30.34506)  (-^)  +  40.29588  )  (^-)2 
n  (3.10) 

It  is  quite  remarkable  that  such  a  complex  phenomenon  can  be  described  satis¬ 
factorily  by  such  a  simple  law. 

The  expansion  of  the  simulated  1°  x  1°  mean  anomalies  was  complete 
up  to  degree  and  order  N,»x  =  300  .  Creating  (or  analysing)  area  mean  values 
up  to  degree  and  order  N,ftX  =  300  required  about  50  seconds  o.p.u.  time 
using  double  precision  arithmetic  in  the  AMJDHAL  computer  at  O.S.  U. 

Table  3.8  shows  the  actual  percentage  rms  sampling  error  per  degree 
as  computed  in  one  of  the  trials,  and  the  percentage  rms  propagated  noise 
(theoretical)  corresponding  to  a  1  jngal  rms  noise  in  the  data.  Clearly,  the 
errors  are  much  smaller  than  for  any  of  the  cases  considered  previously; 
this  improvement  is  due  to  the  finer  sampling  (the  sampling  error  tends  to 
zero  as  the  area  of  the  blocks  tends  to  zero).  The  data,  however,  tends  to 
be  noisier  when  averaged  on  smaller  blocks,  so  the  propagated  noise  may  in¬ 
crease.  For  a  given  rms  error  in  the  data,  multiply  the  number  in  the  "pro¬ 
pagated  noise"  column  by  this  rms  (in  ragals)  to  obtain  the  corresponding  per¬ 
centage.  These  numbers  arc  only  valid  for  the  estimator  where  pn  =  —1—  . 
Repeated  trials  with  different  random  coefficients  resulted  in  much  4n$» 
the  same  percentages  for  the  sampling  errors,  so  these  values  are  probably  fairly 
typical.  -78- 
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Table  3.8 

Percentage  rms  errors  per  degree  for  a  1°  x  1° 
equal  angular  grid.  ■  jJ*- ,  N,,*  =  300, 

€  Ag  =  1  mgal 

actual 

propagated 

n 

sampling 

error  (theor. ) 

2 

0.01 

0.47 

10 

0.10 

0.72 

20 

0.46 

1.39 

30 

0.98 

1.72 

40 

1.55 

1.93 

50 

2.68 

2.07 

60 

4. 71 

2.20 

70 

6.94 

2.34 

80 

9.43 

2.47 

90 

13. 51 

2.62 

100 

14.79 

2.77 

110 

17.98 

2.96 

120 

25.06 

3.08 

130 

25.33 

3.26 

140 

30.87 

3.43 

150 

37.02 

3. 62 

160 

44.17 

3. 81 

170 

44.43 

4.03 

180  (N) 

53.07 

4.24 

190 

61.87 

4.47 

200 

65.90 

4.73 

A  possible  way  of  bringing  the  sampling  error  down  is  to  use  weighted 
area  means  of  the  form 


where  Ag(k)  is  the  kth  measurement  inside  the  block  ,  and  where  the 
Wk  are  functions  of  the  distance  of  Ag®  to  the  center  of  the  block.  It  the 
Wk  decay  gently  towards  the  border  of  the  area  element,  the  resulting 
weighted  means  will  be  smoother  than  the  ordinary  area  means  considered  so 
far  (all  Wk  =  N^j )  and  their  harmonic  content  above  the  Nyqulst  frequency 
will  be  atenuated.  Consequently,  the  harmonics  below  N  can  be  recovered 
with  less  sampling  error.  This  idea  certainly  deserves  further  study,  ob- 

vkrnsly,  it  is  applicable  only  in  those  oases  where  the  original  measurements 
Agtj  are  available. 


L 


3.4  The  Analysis  of  a  Global  Data  Set  of  5°  x  5°  Mean  Anomalies 


As  a  final  demonstration  of  the  use  of  optimal  estimators,  this  paragraph 
presents  some  results  obtained  by  analysing  a  real  data  set,  consisting  of  5°  x  5° 
mean  gravity  anomalies.  These  anomalies  were  obtained  from  a  global  set 
of  1°  x  1°  mean  values  created  by  R.  Rapp  and  associates  (Rapp,  1978)  from 
the  combination  of  land  and  gravity  measurements,  satellite  altimetry  over 
the  oceans,  and  the  GEM-9  satellite  model  (Lerch  et  al. ,  1979). 

The  5°  x  5°  values,  £g(s°)  »  were  obtained  from  the  1°  x  1°  values, 
using  the  formula 

=  -25"  £ 

where  I  represents  summation  over  all  1°  x  1°  blocks  inside  a  5°  x  5° block. 
The  variances  of  the  were  obtained  from  the  formula 

t,aVr  <£f  £ 

where  the  assumption  has  been  made  that  the  errors  in  the  1°  x  1°  values 
were  uncorrelated,  which  is  not  quite  true,  as  the  values  used  ware  the  pro¬ 
duct  of  an  adjustment.  The  variances  03  Ag.go.  were  different  from  5°  block 

to  5°  block,  but  they  were  "  homogene ized"  as  described  in  paragraph 
(2.9),  in  order  to  obtain  a  (C„,  +  D)  matrix  that  was  easily  invertible  and  re¬ 
sulted  in  a  quadratures-type  estimator  inexpensive  to  implement. 

Figure  (3.1)  shows  a  comparison  between  the  collocation  solution,  com¬ 
plete  to  degree  and  order  36  ,  and  the  same  coefficients  obtained  by  R.  Rapp 
by  numerical  quadratures  from  the  original  1°  x  1°  values.  The  figure  shows 
the  rms  per  degree  (  yo?7(2n  +  l))  for  potential  coefficients  (crna(T)  = 

°n  (Ag)  “f*  »  y  ~  979800 mgal).  The  circles  correspond  to  Rapp's  re¬ 

sults,  and  the  triangles  to  those  obtained  using  collocation  as  already  explained. 
Because  the  grid  used  by  Rapp  is  much  finer,  the  corresponding  results  are 
likely  to  be  less  affected  by  sampling  errors,  at  least  up  to  degree  36  ,  then 
those  obtained  from  the  5°  x  5°  anomalies;  for  this  reason  Rapp's  rms  values 
are  regarded  here  as  the  "true"  ones.  The  solid  line  corresponds  to  c.  Jeleki's 
"2L"  model  for  the  o \  ,  used  here  to  obtain  the  optimal  estimator.  It  is  inter¬ 
esting  to  notice  that  the  triangles  follow  the  circles  (or  "true"  values)  rather 
than  the  line.  A  common  concern  among  those  using  this  type  of  estimators  is 
to  what  extent  the  "a  priori"  power  spectrum  or  covariance  function  used  to  set 
up  the  estimator  may  "bias"  the  results  by  forcing  the  spectrum  of  the  output 
to  resemble  the  "a  priori"  spectrum.  Here  there  is  little  evidence  of  such  a 
"bias". 


In  addition  to  Rapp's  coefficients,  those  of  GEM-10B  (Lerch,  1980) 
were  also  used  for  comparison.  The  "collocation"  values  follow  them  very  closely 
too  (the  three  sets  of  results  agree,  in  fact,  very  well  with  each  other).  The 
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GEM  values  are  not  shown  In  the  figure,  to  have  a  cleaner  picture.  In  any 
event,  the  "collocation”  values  compare  very  well  with  the  other  two,  and  in  the 
higher  degrees  (31  to  36)  where  the  divergence  between  Rapp's  model  and  col¬ 
location  is  largest,  GEM-10B  fits  right  in  between  them.  The  results  shown 
here  were  presented  in  a  previous  paper  by  the  author  (Colombo,  197&-b). 


Figure  3. 1 


-81 


4.  Covariances  Between  Area  Means 


In  what  follows,  the  expression  "point"  covariance  refers  to 

OB 

COV(U(P),V(P'))  =  E  c/PaiM  *  M{u(P)»V(P')}  (4.1) 

Q  3  0 

The  covariance  between  the  area  means  of  two  functions  u  and  v  is  related 
to  the  "point"  covariance  function  by  the  following  integral  relationship  (Sjoberg, 
1978): 

cov (uu,vkl)  »  777“  L  d<7  J  cov(u(P),v(P’))d o’  (4.2) 

where  P  belongs  to  the  area  block  o1Jt  and  P'  to  okl,  while 

uii  ■  f  u(P)  do  (4.3) 

Jou 

is  the  average  of  u  over  the  block  ou  of  area 

Au  =  AX(cos(0!)  -  cos(9i+A01))  (4.4) 

A0t ,  A  A  are  the  colatitude  and  longitude  spans  of  the  blocks  in  the  "row"  between 
colatitudes  0!  and  0t  +A0t.  To  simplify  the  discussion,  A0t  =A0  is  assumed 
constant;  extension  to  the  more  general  case  where  A0i  varies  from  row  to  row 
is  straightforward. 


4. 1.  Derivation  of  an  Approximate  Formula  for  the  Covariance1 

Expression  (4.  2)  can  be  computed  by  numerical  quadratures  (Rapp,  1977). 
Because  of  the  double  integral,  this  is  a  very  laborious  process,  and  it  is  not 
practical  if  many  covariance  values  have  to  be  found,  as  in  the  case  of  large  data 
sets.  A  more  efficient  alternative  is  needed. 

Replacing  the  covariance  function  in  the  integrand  of  (4.2)  with  its  Legendre 
expansion 

CO V(uljfvkl)  -  Jatj  dff  to  cu/PB(^,)  (4.5) 

Here  =  cos-1  (cos  0  cos0'  +  sin0  8ln0'  cos  (A  -  A'))  is  the  spherical  dis¬ 
tance  between  points  P  *(0,A),  and  P'  ■(•'.A')  in  the  unit  sphere,  while 

z;  ■Bi  ou  +  si  §:. 

■  *0 

is  the  nth  degree  variance  of  the  cross -spectrum  of  u  and  v.  As  shown  in  the 
Appendix,  the  order  of  summation  and  Integration  can  be  reversed. 

1  In  this  section,  expression  P8(tb»»')  is  shorthand  notation  for  Pn(cos 
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cov(u,j,Vkl)  =  — —  E  cUn’V  f  da  f  Pb(VW')  da* 

A‘jAia  a=o  Jau  JakI 


(4.  6) 


Let  P01  (cos  0)  be  the  fully  normalized  associate  Legendre  function  of  the  first 
kind  of  degree  n  and  order  m.  Applying  the  Summation  Theorem  for  such 
functions,  the  PB(&i>f)  can  be  replaced  in  the  integrand  as  shown  below. 


cov 


00  ufv  *  p  a 

(uu.Vki)  —  E  L  da  I  E  ?n.(cos0)  PB.(cos9')cos  m(A- A')da' 

b=o  2n+i  datJ  ■  =  o 

(4.7) 


The  sum  in  the  general  term  of  the  series  above  has  a  finite  number  of  terms 
(n  +  1),  so  it  is  valid  to  exchange  summation  and  integration  once  more: 


cov  | 


r(5~ij,V|tl)  =  r— t  E  5^fr  £  L  da  L  Pn«(cos  8)  Pm(cos  8')  cos  m(A  -  X')  d  a' 

(4.  8) 

Writing  out  the  area  integrals  as  colatitude  and  longitude  integrals-. 

00 

1  E 

0  *«->-■*■  iso  "Sj 

•A3+AA  i*Aj+AA 


COV(U1J,Vkl)  = 


Al3Akl  n  =  0  2n 

ek+Aek_ 


9i+A0i_ 

PB,(cos0)  sin8  d8 


J9k  9lt  P„,(cos  8’)  sin  S’  d8'  J 
9k  A} 


dX 


L 


% 


(4.  9) 
cosm(A-A')  dA' 


Calling  Aj  s  a13  ,  as  all  blocks  in  the  same  "row"  have  the  same  area,  we 
introduce  two  functions. 


and 


_  f  AAa  if  m  =  0 

m  t(2/ma)(l-cosmAA)  otherwise. 


(4. 10 -a) 


(4. 10-b) 


Then  expression  (4.9)  can  be  written,  after  integrating  and  reordering  terms,  as 
follows: 


r>  09 

cov(uu,vkl)  =  hoF(m)  E  L.,i  In.,k  cosm(ArAj) 


(4.  ll-a) 


This  regrouping  of  the  series  is  valid  because,  as  shown  in  the  Appendix,  the  series 
is  absolutely  convergent.  The  integrals  in  the  definition  of  l„,tI  (expression  (4. 10-b)) 
can  be  calculated  very  accurately  and  efficiently  with  recursive  formulae  obtained  by 
M.  K.  Paul  (1979).  Regarding  j  as  being  fixed,  the  last  expression  is  also  that  of  the 
Fouiicr  series  of  cosines  of  cov(Ui3 ,  vkl) ,  with  amplitudes 


*»  =  F(  m)  £  Ia«,t  I01jk 


(4. 11-b) 


and  phases  <0.  -  -mAj  . 


f: 

V. 


L 


-83- 
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While  (4. 11-a,  b)  are  valid  for  all  values  of  X3  in  the  interval  0  £  X}  s  2tt, 
this  egression  has  to  be  calculated  only  for  Xj  s  0,AA  ,  2AX, . . .  ,jAX.  According 
to  the  sampling  theorem  for  ordinary  Fourier  Series,  at  such  regularly  spaced 
expression  (4. 11-a)  takes  precisely  the  same  values  as  the  finite  sum 

N 

cov(uw,vitl)  =  a^  cos  m(  A;  -  Ai)  (4.12) 

■  *0 

where  N  *  h/aA 

a^1  =  E  aaNh+«  +  E  a3Nti-»  (4.13) 

h=0  h=l 

To  calculate  the  covariances,  (4. 11-a,  b)  and  (4.13)  have  to  be  truncated,  excluding 
all  harmonics  above  some  degree  Nmaxt  (4. 12)  becomes: 

N  *  Na»X 

OOV(UiJt?M)  COV(uu,Vkl)  =  ETE  E  (In^Nh+«,l)(I»^Nlx+.,k) 

wmax  .=oU=os=i 

K  N««x 

+  E  E  (  In,aNh-m,l)(  ln,3Nh-»,k)l  F(  m)  COS  m(Aj  -  X)  (4.14) 

where  (2K  +1)N  s  Nmfly  . 


4.2.  Choosing  N 


max 


To  calculate  the  values  of  covariances  between  equal  angular  mean  anomalies,  Ag, 
the  value  of  Nmay  could  be  chosen  so  that  the  percentage  error 


v  = 


covfSgtj.Sguto  -  cov(5g1j,Iacl)N 


COV(2gu  ,£gkl)q 


x  100 


(cov(Ag1J  ,Agki  )q  being  computed  by  numerical  quadratures)  does  not  exceed  a  pre¬ 
scribed  upper  bound  e  .  The  smallest  Nrpfly  that  meets  this  condition  increases 
towards  the  poles,  because  the  decrease  in  area  of  equal  angular  blocks  with  latitude 
means  that  the  averages  have  a  high  frequency  content  that  increases  accordingly. 

On  the  other  hand,  the  absolute  values  of  the  integrals  of  the  PM  ,  and  therefore 
the  Fourier  coefficients  a,  in  (4. 12),  decrease  quite  fast  with  increasing  order  m 
near  the  poles,  so  their  contribution  soon  becomes  insignificant.  This  is  fortunate, 
because  the  need  for  lengthier  calculations  for  each  Fourier  coefficient  nearer  to 
the  poles  can  be  offset  by  the  existence  of  fewer  coefficients  there.  In  fact,  oecause 
of  the  finite  arithmetic  of  digital  computers,  all  a,  for  blocks  less  than  30°  from 
the  poles  are  rcunded  off  to  zero  for  m  considerably  less  than  N  in  the  cases 
presented  here.  Because  of  this,  calculations  near  the  poles  can  be  less  laborious 
than  close  to  the  equator,  in  spite  of  the  larger  Nmav  .  To  take  advantage  of  this 
in  the  programming  of  (4. 14),  the  a,  coefficients  were  not  computed  above  the  first 
m  for  which  the  following  condition  was  met: 
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a2 


n-1  AS 

.s  H 

A  =  o 


z  6 


and 


K 

%A 


6, 


1=0 


where  6  £  lO"12.  This  ensured  no  change  in  the  first  six  significant  figures  of 
the  result  when  compared  to  the  case  where  no  coefficients  were  ignored,  but 
there  were  great  savings  in  computing. 


4.3.  Numerical  Examples 

To  verify  the  accuracy  of  the  truncated  series  in  expression  (4.14),  mean 
gravity  anomaly  covariances  were  computed  both  with  this  formula  and  by  numeri¬ 
cal  quadratures.  All  calculations  were  carried  out  in  double  precision  (32  bytes), 
some  of  those  for  expression  (4. 14)  being  repeated  in  extended  precision  (64  bytes). 
The  agreement  between  both  sets  of  results  was  better  than  6  significant  figures, 
suggesting  that  both  expression  (4.14)  and  Paul's  recursives  (used  to  obtain  the 
Ini)jI)  are  quite  stable  numerically,  at  all  latitudes. 

The  covariances  computed  were  between  a  mean  anomaly  in  a  fixed  block 
and  all  the  anomalies  in  the  same  row  of  blocks  (i.  e. ,  all  blocks  bounded  by  the 
same  parallels),  calculations  being  done  for  several  rows,  at  close  intervals  from 
equator  to  pole.  Results  for  only  a  few  of  those  will  be  shown  here,  because  they 
are  typical  of  the  rest.  Both  a  5°  and  a  1°  grid  (equal  angular)  were  studied.  The 
results  were  compared  to  those  obtained  by  numerical  quadratures,  and  by  the 
approximate  formula 

_  N»«* 

cov (Agij,Agki)  =-  C„  ’  Pn(tl)yy')  (4.35) 

n  =  0 

where  Y  and  Y'  are  the  center  points  of  and  crkl,  while 

/3  i  =  - 1 —  --1—  [  Pn^  (cos  i/>0)  -  Pn+ 1  (cos  0O) ) 

’  1  -  cos  yo  2  a  + 1 

is  the  nth  degree  Pellinen  smoothing  factor  (the  formula  is  Meissl's),  and 

Ipo  =  COS-1 1" (  COS  (  0!  +  1  )  -  COS  01  )  + 

This  formula  gives  the  covariances  of  averages  on  circular  blocks  of  the  same 
area  as  that  of  the  equal  angular  blocks  in  the  row  between  colatitudes  8t  and  0)+1 . 
These  covariances  are  used  sometimes  as  approximations  to  the  equal  angular  co- 
variances  between  mean  values. 

The  numerical  quadratures  technique  consists  in  the  following;  (a)  sub- 
do  .uiug  each  block  with  a  grid  of  k  equally  spaced  latitude  and  k  equally  spaced 
longitude  lines;  (b)  computing  the  covariances  between  point  gravity  anomalies 
at  the  nodes  of  each  subdivision  (there  are  k4  different  pairs  of  nodes  to  be  consi¬ 
dered);  (c)  obtaining  the  approximate  value  of  the  covariance  between  mean 
anomalies  as 
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(4.16) 


r 

j 

k— 1  k-1  k-1  k— 1  fJ 

cov  (Agiit  Agki)  =  E  E  E  E  cov  ( 4  g  "a  git  i) 

■  s  0  isO  ra  0  •=  0 

where  m,  n,  r,  and  s  are  indexes  that  identify  the  elements  of  each  pair  of  nodes 
in  the  subdivisions  of  au  and  akl.  The  covariances  between  point  anomalies  such 
as  Ag“  and  AgkJ  were  obtained  using  a  two  term  model  for  the  degree  variances 
of  the  anomalies: 

e**  =  18.3906-^--^  0. 9943667a+3  +  658.6132 - j~L--  o.904894SP+2 

“  (n+100)  (n+20)(n-2) 

This  is  the  "2L”  model  of  C.  Jekeli  (1978),  and  has  the  advantage  that  the  value  of 
the  point  covariance  function  can  be  computed  using  finite  recursion  formulas 
(Moritz,  1977).  The  same  degree  variances  were  used  in  (4. 14)  and  (4. 15).  The 
number  of  point  covariances  being  very  large  (k4),  a  table  with  entries  spaced 
at  A =  0.05°  intervals  was  created  first,  the  required  values  being  obtained  by 
linear  interpolation  from  this  table.  Numerical  tests  showed  that  k  -  10  was 
large  enough  for  both  5°  and  1°  blocks,  because  doubling  this  number  resulted  in 
a  change  of  less  than  0. 2%  in  the  values  given  by  (4. 16) .  Reducing  the  interval 
A^i  from  0.05°  to  0.005°  had  a  negligible  effect  also,  therefore  the  values  obtained 
with  (4. 16)  are  probably  accurate  enough  to  test  those  given  by  (4. 14).  The  only 
exception  was  the  "polar”  row,  where  the  equal  angular  blocks  are,  in  fact,  tri¬ 
angles  with  a  common  vertex  at  the  pole.  Both  with  5°  and  1°  blocks  the  dis¬ 
crepancies  between  (4.14)  and  (4.16)  were  large  (more  than  30%),  regardless 
of  how  large  a  k  ,  how  small  a  At(i ,  or  how  big  a  Nm9Y  were  chosen.  The 
probable  explanation  is  that  the  pole  is  given  undue  weight  in  (4.16)  because  it  is 
treated  as  a  whole  row,  instead  of  as  a  single  point.  For  this  reason,  the 
numerical  examples  presented  here  stop  at  the  row  immediately  below  the  pole. 

The  first  two  tables  show  the  covariances  between  5°  mean  anomalies  in  the 
row  between  latitudes  0°  and  5°  (just  above  the  equator)  and  in  the  row  between 
latitudes  80°  and  85°  (one  below  the  pole).  The  error  is  at  most  8%,  though  much 
less  in  most  cases,  and  Njuax  =  180  in  each  table.  Under  "Pellinen"  one  finds 
the  values  obtained  using  (4. 15)  with  due  regard  for  the  change  in  block  areas  with 
latitude.  While  there  is  very  good  agreement  near  the  equator,  there  is  no  resem¬ 
blance  at  all  close  to  the  pole  to  the  other  values  listed. 


I 
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Table  4.1.  Comparison  between  covariances  of  5°  mean  anomalies  Ag 
computed  with  expressions  (4. 14),  (4. 16)  and  (4. 15), 
respectively.  Nmax=  180.  Row  between  0°  and  5°. 


Expression  (4.14) 

Numerical 

Pellinen 

Block  No. 

250.53 

253.77 

251.78 

1 

148.80 

149.46 

151.03 

2 

193.93 

93.95 

93.90 

3 

57.16 

57.15 

57.10 

4 

31.80 

31.76 

31.67 

5 

13.94 

13.93 

13.86 

6 

-18.09 

-18.09 

-18.08 

12 

9.12 

9.12 

9.11 

24 

-13.68 

-13.68 

-13.65 

36 

Table  4.  2.  Comparison  between  covariances  of  5°  mean  anomalies  Ag 
computed  wllh  expressions  (4.14),  (4.16)  and  (4.15), 
respectively.  Nmax=  180.  Row  between  80°  and  85°. 


Expression  (4. 14) 

Numerical 

Pellinen 

Block  No. 

418.60 

437.23 

835.53 

1 

329.03 

318.27 

800.09 

2 

220.91 

229.17 

709.03 

3 

191.75 

196.40 

592.19 

4 

179.79 

179.70 

5 

165.58 

168. 46 

6 

120.60 

123.73 

12 

69.89 

73.27 

24 

54.66 

58.01 

149.40 

36 

The  next  three  tables  show  results  for  1°  blocks.  Here  the  numerical 
method  was  conducted  with  the  same  A0  and  k  as  in  the  case  of  the  5°  grid. 
Results  for  rows  between  latitudes  0°  and  1°,  45°  and  46°,  and  88°  and  89° 
are  shown.  In  tables  4.3  and  4.4  the  discrepancies  between  (4. 14)  and  (4. 16) 
stay  below  1%;  this  increases  to  about  5%  near  the  pole  (table  4. 5).  NffaT  is 
300  for  the  equatorial  row,  and  rises  to  400  from  45°  on.  As  with  5°  blocks, 
the  "Pellinen"  values  are  quite  close  to  these  of  (4. 14)  and  (4.16)  near  the 
equator,  but  become  very  different  near  the  pole. 


Table  4.3.  Comparison  between  covariances  of  1°  mean  gravity 

anomalies  computed  with  (4. 14)  and  (4. 16).  300. 

Row  between  0°  and  1° . 


Expression  (4.14) 

Numerical 

Block  No. 

849.68 

855.16 

1 

411.32 

410.30 

2 

220.35 

219.63 

3 

181.84 

181.50 

4 

163.33 

163.21 

5 

149.11 

149.15 

6 

1.07 

1.08 

31 

-16.79 

-16.79 

61 

1.99 

1.99 

91 

8.54 

8.53 

121 

-4.79 

-4.79 

151 

-14.39 

-14.38 

181 

Table  4.4.  Comparison  between  covariances  of  1°  mean  gravity 

anomalies  computed  with  (4.14)  and  (4. 16).  Nmax*»  400. 
Row  between  45°  and  46°. 


Expression  (4. 14) 

Numerical 

Block  No. 

952.25 

959.17 

1 

531.15 

531.97 

2 

275.55 

274.65 

3 

210.81 

210.46 

4 

185.36 

185.57 

5 

170.82 

171.08 

6 

27.79 

27.79 

31 

-14.37 

-14.37 

61 

-18.01 

-17.01 

91 

-8.27 

-8.28 

121 

-1.01 

-2.92 

151 

1.39 

1.39 

181 

18- 


Table  4.  5.  Comparison  between  covariances  of  1°  mean  gravity 
anomalies  computed  with  (4. 14),  (4. 16),  and  (4. 15), 
respectively.  Nmax=  400.  Row  between  88°  and  89°. 


Expression  (4.14) 

Numerical 

Pellinen 

Block  No. 

1137.97 

1187.94 

1795.88 

1 

1135. 79 

1184.15 

1795.02 

2 

1129.28 

1177.99 

1792. 45 

3 

1118.55 

1155.08 

1788.18 

4 

1103.76 

1131.34 

1782.25 

5 

1085. 15 

1102.85 

1774.68 

6 

408.59 

443. 12 

31 

248. 27 

257.88 

61 

207. 19 

210.64 

91 

189.65 

192.55 

121 

181.93 

184.53 

151 

179.65 

179.86 

368.60 

181 

Calculations  were  carried  out  in  the  Ohio  State  University's  AMD  HAL 
470  V/6-n  computer,  using  the  FORTRAN  H  EXTENDED  compiler,  and  double 
precision.  The  computing  times  for  obtaining  all  37  different  covariances 
between  the  elements  of  a  row  of  5°  mean  anomalies,  and  all  181  covariances 
in  a  row  of  1°  mean  anomalies,  using  (4. 14)  and  (4. 16)  are  listed  in  table  4.  6 
for  comparison.  The  integrals  of  the  Legendre  functions  required  by  (4.14), 
and  the  table  of  point  anomalies  covariances  needed  in  (4. 16),  were  precomputed 
and  stored  in  core  memory  arrays,  so  the  times  given  here  do  not  include  the 
determination  of  those  auxiliary  values.  In  most  ordinary  applications  of  these 
formulas,  those  quantities  can  be  read  from  disk  or  tape  whenever  needed, 
because  they  are  the  same  for  a  whole  variety  of  problems.  Clearly,  using 
(4. 14)  can  be  orders  of  magnitude  more  efficient  than  using (4. 16),  while  the 
accuracy  is  much  the  same.  In  fact,  accuracy  is  probably  better  at  the  polar 
rows  with  (4. 14)  than  with  (4. 16),  because  the  latter  seems  to  have  problems 
handling  triangular  blocksi  Finally,  not  all  of  the  time-saving  properties  of 
(4. 14)  were  exploited  in  the  computer  program  used  to  calculate  it,  so  there  is 
3cope  for  some  improvement  in  efficiency  beyond  that  shown  in  the  table.  Notice 
the  time  saved  in  the  88°  -89°  row  thanks  to  the  neglect  of  terms  in  (4. 14)  that 
become  too  small  near  the  poles,  as  explained  in  paragraph  (4.2). 
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Table  4.6.  Comparative  efficiencies  of  algorithms  based  on  expressions 
(4.14)  and  (4.16). 


Block  Size 

Row 

N 

1Nmax 

Expression  (4. 14) 

Numerical  Quadratures 

5° 

u* 

1 

o 

o 

— 

0.1  sec. 

23.3  sec. 

5° 

80° -85° 

■ 

0.1  sec. 

23.3  sec. 

1° 

0°-  1° 

B 

0.35  sec. 

113.3  sec. 

1° 

45° -46° 

0.43  sec. 

113.3  sec. 

1° 

88° -89° 

400 

0.06  sec. 

113.3  sec. 

4.4.  Covariances  Between  Mean  Values  and  Point  Values 


The  prediction  by  least  squares  collocation  of  mean  values  from  point 
values,  or  vice  versa,  requires  finding  the  corresponding  "mixed"  covariances. 
In  such  case,  formula  (4.2)  becomes 


cov(un,  v(0,X))  =  f  cov(u(P'),  v(P))  da 


(4.17) 


where  P'  =  ( 0',X')  e  au  ,  Ps  (0,X),  v(P)  is  the  "point”  value  of  v,  and  Ujj 
is  the  average  over  the  i,  j  block  of  the  grid,  as  before.  Following  a  similar 
reasoning,  one  arrives  at  a  formula  that  corresponds  to  (4.14),  except  that  only 
one  area  integral  has  to  be  considered.  The  new  expression  is: 


u  y  X 

cov(uu,v(e,x))  -  “Eo[hE  ng,(  2n+  l)  +  Pn,2Nh +n(CoS  ®  )  ) 


Naax 


1 

—  _  r 
+  E  Eb  (2°"  1)(In^Nh-»,i  Pn,3Nh-B(cose)j  [(A(m)  cos  m( Xj  -  X ) 

+  B(m)  sinm(Xj-X)J  (4. 18) 


where 


A(m) 


=  f 


AX 


if  m  =  0 


( (cosmAX-l)/m  otherwise 


B(m) 


I  (sim: 


0  if  m  =  0 
(sinmAX)/m  otherwise 


and  thet  ,  the  c“,v  ,  and  the  N,  K,  and  Nmax  areas  in  (4.14).  Expression 

(4.18)  assumes  that  the  area  means  belong  to  a  grid  with  constant  AX.  If  the  point 
values  are  also  on  a  grid,  and  if  this  grid  is  congruent  with  that  of  the  mean  anom¬ 
alies,  implementation  of  (4.  18)  is  quite  efficient.  In  fact,  the  speed  of  a  good 
algorithm  for  doing  this  should  be  much  the  same  as  that  of  one  for  implementing 
(4.14).  On  the  other  hand,  computing  the  same  covariances  by  numerical  quadra¬ 
tures  is  ks  times  faster  in  this  "mixed"  case  than  it  was  in  the  previous  one,  be¬ 
cause  there  is  only  one  area  integration  involved.  Assuming  k  =  10,  as  in  the 
previous  examples,  then  expression  (4. 18)  should  be  only  2-3  times  faster  than 
numerical  quadratures  for  5°  mean  values,  and  from  4  to  15  times  faster  in  the 
case  of  1°  averages. 
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Both  the  normalized  Legendre  function  and  their  integrals  are  needed  for 
(4. 18)  .  They  can  be  precomputed  and  stored  on  disk  or  tape  until  needed,  in 
this  study  the  following  recursive  relationships  were  used  to  generate  their 
values: 

P— ,.«(«.«)  -  *(■»■»  8  iu  ,->(00.  9  > 

r(2n-  l)(n  +  m  -  3)(n  -  m  - 

“L  (2n-;5)(n  +  m-2)(n-m)  J  pn-3,»-i(cos  0),  m^n  (4.19a) 
P.-i,.-i(oosfl)  =  [«]%*«  P„-v-=<co3e, 

-  ■^[■ff1i,^2+°m'.32)-]'sinsep._a>._,(cose) 


(4. 19b) 


,  (n-3)r(2n-l)(n  +  m-3)(n-m-m^re3-.  a  ^ 

n  L  (2n-5)(n  +  m-2)(n-m)  J  Je  p»-3,»-i(cos  0)  sin  0  d0  (4.20a) 


Ga 

6i 


r02-  -  _  _  1  r  (2n  - 1)  „ 

J01  Pn-i,n-i(cos0)sin0  d0  =  ^(h -  l)(n -  2)  J  sin  9  P»-3,n-3(cos  0) 


6a 


J_nn 

2nL 


-  l)(2n  -  11(2  n-  3n ®  p 02 


(n-2) 


-•)  sin0  d0 


(4.20b) 


if  0  is  not  very  small,  or 
•  0 
0! 


.03 


JeaPn-i,„<.i(cos0)  s in  0  d0  =  -|~-£|n  ~  *)(2n  ~9)‘  •  •  3~|^  sin”+10 

J0j.  ’  l_(2n-2)(2n  -4). .  ,4J  sm 

r  1  +  1  sina9  1*3  sin49  1-3-5  sin^  -| 

Ln  +  1  2  n  +  3  +  2-4  n  +  5  +  2-4-6  n  +  7  +"’J 

if  6  is  very  small.  Here  y(  0)  s  y(  0a)  -  y^) .  The  recursive  form 
the  integrals  of  the  Legendre  functions  were  derived  by  M.K.  Paul  (1978); 


(4.20c) 


formulas  for 

^  _ T _ w  _ _  <  v  tlic 

author  has  been  fortunate  enough  to  have  available  a  FORTRAN  subroutine  pro¬ 
grammed  by  Paul,  and  kindly  sent  by  him  to  Professor  R.H.  Rapp  of  the  Depart¬ 
ment  of  Geodetic  Science  at  O.S.  U.  The  results  reported  here  have  been  made 
possible  by,  and  bear  witness  to,  the  great  numerical  stability  of  Paul's  for¬ 
mulas. 
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5.  Conclusions 


The  relationships  between  spherical  harmonic  series  and  Fourier  series, 
coupled  to  the  symmetries  of  spherical  grids,  permit  the  development  of  ef¬ 
ficient  algorithms  for  numerical  analysis  of  data  regularly  sampled  on  the  sphere. 

The  algorithms  presented  In  section  1  for  implementing  numerical  quad¬ 
ratures  are  efficient  enough  to  allow  the  analysis  of  64800  1°  x  1°  mean  values 
through  degree  180  in  less  then  20  seconds,  and  the  summation  of  the  90000 
terms  of  a  harmonic  series  complete  through  degree  300,  on  a  full  1°  x  1° 
equal  angular  grid,  in  less  than  1  minute.  The  analysis  of  large  global  data 
sets  to  a  very  high  resolution  is  a  relatively  trivial  operation  with  modem 
digital  computers. 

The  principles  and  ideas  behind  the  optimal  estimators  of  section  2  pro¬ 
vide  a  rational  basis  for  the  study  of  linear  techniques  for  spherical  harmonic 
analysis,  both  optimal  and  non -optimal.  The  error  measure  introduced  in  this 
section  is  shown  to  be  a  very  reasonable  way  of  evaluating  the  estimation  error, 
as  illustrated  by  the  results  listed  in  table  (3. 1). 

The  optimal  estimators  themselves  are  reasonably  easy  to  compute  and 
use,  particularly  when  they  are  of  the  quadratures  type,  which  happens  under 
the  fairly  general  conditions  discussed  in  section  2.  Even  when  such  conditions 
are  not  present,  as  in  the  case  of  scattered  data,  the  problem  still  has  a 
structure  strong  enough  to  allow  efficient  algorithms  for  creating  and  inverting 
the  normal  matrix. 

The  separation  of  the  problem  of  estimating  the  coefficients  (by  least 
squares  collocation  or  by  least  squares  adjustment)  according  to  the  order  m  of 
the  coefficients,  allows  both  for  efficiency  and  for  numerical  stability.  Even 
if  the  total  number  of  unknown  coefficients  is  very  large,  the  largest  matrix 
to  be  Inverted  is  of  dimension  N  ,  instead  of  0(N2),  as  it  would  be  if  the  prob¬ 
lem  could  not  be  separated  in  this  way. 

All  the  algorithms  presented  here,  when  the  grid  is  complete  and  regular, 
are  well  suited  to  parallel  processing. 

In  the  case  of  full  grids  of  mean  values  with  even  noise,  the  results  of 
section  3  suggest  that  the  optimal  "collocation"  estimator  can  be  approximated 
very  closely  by  a  much  simpler  quadratures -type  formula,  the  "composite  for¬ 
mula"  (3.9).  The  search  for  simple,  near-optimal  estimators  is  just  as  impor¬ 
tant,  from  a  practical  point  of  view,  as  the  search  for  efficient  algorithms  for 
obtaining  the  optimal  estimators  themselves.  This  is  a  topic  that  certainly 
deserves  fiirther  research. 

The  methods  for  creating  and  Inverting  the  normal  matrix,  that  make 
possible  to  find  optimal  estimators  for  large  data  sets,  have  application  outside 
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spherical  harmonic  analysis,  in  all  areas  of  estimation,  filtering  and  prediction 
on  the  sphere.  This  has  been  the  subject  of  a  previous  report  (Colombo,  1979). 

It  must  be  added  that  the  principles  presented  here  can  be  generalized  to  bodies 
of  revolution  other  than  the  sphere;  to  the  case  where  the  data  are  not  homo¬ 
geneous  (i.e. ,  a  mixture  of,  say,  gravity  anomalies,  satellite  altimetry,  etc.); 
to  the  case  where  the  coefficients  to  be  estimated  are  not  those  of  the  signal  as 
given,  but  of  some  more  or  less  complex  linear  transformation  of  this  signal, 
satellite-satellite  tracking  data  being  a  good  example.  In  fact,  the  author  is 
at  present  considering  the  error  analysis  of  the  determination,  by  least  squares 
collocation,  of  the  potential  coefficients  from  satellite-satellite  tracking,  fol¬ 
lowing  some  of  the  principles  of  section  2.  This  will  be  the  subject  of  another 
report. 

The  method  developed  in  section  4  to  calculate  the  covariance  between 
two  area  means  without  employing  cumbersome  numerical  Integrations  is  of 
interest,  not  only  in  spherical  harmonic  analysis,  but  more  generally  in  fil¬ 
tering,  prediction  and  estimation  from  mean  values  on  the  sphere. 

The  computer  programs  described  and  listed  in  Appendix  B  should  help 
the  interested  reader  to  implement  some  of  the  techniques  discussed  in  this 
report.  The  author  sincerely  hopes  that  this  will  be  done  by  workers  concerned 
with  improving  and  further  developing  such  methods. 

Above  all,  the  author  hopes  that  he  has  conveyed  to  those  who  had  read 
this  far,  the  Idea  that  the  detailed  analysis  of  very  large  sets  of  global,  reg¬ 
ularly  sampled  data  can  be  done  within  the  computing  resources  available  today 
to  most  scientists  who  work  at  universities  and  research  institutions  every¬ 
where.  The  processing,  be  it  by  numerical  quadratures,  or  by  simultaneous 
adjustment,  of  "all  the  data  in  the  world"  is  not  a  fanciful  thought,  but  a  practical 
possibility. 
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Appendix  A.  Term  by  Term  Integration  of  Formula  (4.5),  and  Rearrangement 

of  Terms  in  Formula  (4.9)  to  Arrive  at  Formulas  (4. 11-a)  and  (4.14) 


The  area  integrals  in  expression  (4.5)  can  be  split  into  integrals  in  0  and 
X,  and  the  summation  theorem  for  fully  normalized  spherical  harmonics  can  be 
used  to  replace  Pn(^PP')  with  an  equivalent  expression  in  the  general  term  of  the 
series; 


cov(u1J,v!tl)  ■  - — —  |  sin0d0, 
A*iAwJet  J9t! 


(*e**Ae  rXj+AX  «Xi+ax  ® 

"  m,  r  dx ;  dx*  f  c1/  • 

^  Xj  n  =  0 


£  Paa(cos  9)  PM  (cos  0' )  cos  m(  X  -  X' )  (2 n  +  If 1 


(A.  1) 


b"0 


Starting  with  (A.  1),  the  proof  proceeds  in  four  steps,  each  justifying  in  turn 
the  taking  of  one  of  the  four  integrals  inside  the  summation  symbols.  Each  time, 
three  theorems  are  invoked: 


The  first  is  the  "M-Test"  theorem,  due  to  Welertrass  (see,  for  instance,  Carslaw, 
(1950)):  a, 

The  series  f(x)  =  4*0u(x)B  will  converge  uniformly  in  a^x^b 
if  there  is  a  convergent  series  of  positive  constants  M0,  MT, ... ,  M„ , ... 
such  that,  for  all  x  in  a^x«b,  |u(x)n|£Ma  for  every  positive 
integer  n. 

The  second  theorem  is  (also  according  to  Carslaw): 

00 

If  the  general  term  u(x)„  of  the  series  Eou(x)„  is  continuous, 
and  if  the  series  converges  uniformly  to  some  function  f  (x)  in 
the  Interval  a^x^b,  then 

p  b  p  b  «  rnPb 

;■  f(x)dx  =  £u(x)ndx  =  £  u(x)B  dx 

“■  «»  B=0  qbO  *b 

This  is  a  sufficient  condition  for  term  by  term  integration.  The  third  theorem 
is  the  mean  value  theorem  for  integrals 

If  f(x)  is  analytic  in  a«xsb  then  J^f(x)dx  =  (b-a)f(c) 
for  some  c  such  that  a  £  c  s  b  . 


Proof:  The  series  in  the  left  hand  side  of  (A.  1),  if  all  variables  but  X’  are  kept 
fixed,  is  uniformly  convergent  in  the  interval  Xt  £  X'  s  X:  +  AX  because 

n 

(2n  +  ,r  tEQ  PBa  (cos  ® )  Pn,  ( COS  0')  cos  m(  X  -  X’)  =  c„“’T  Pa  ( b, ») 

and  |cny*rPa(M  j  s  ca,r  because  max  |  Pa(i/>)  I  *  1  (the  argument  of  Pa  is  real)  for 

o  s  *  £  tt 
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ail  n,  while  E0cB*Y  =  cov(u(P),v(P))  ia  always  finite  (equals  the  value 
of  the  '^soint"  covariance  when  i i^p>  =  0  ).  The  "M-Test"  condition  is  satisfied 
so  the  series  converges  uniformly;  therefore,  term  by  term  integration  with 
respect  to  X’  is  valid: 


i  fBj+Ae  _  ^  f'0it+ae  „  (‘Art*A 
cov(utj,vkl)  »  -j— —  J  sin8d0j  sln0*  d6'j  dX  • 

hi  9j  0k  X j 


©ic  +  Ae 


rX:+AX 


•  r  o.-.’  £  5a,(coe§)  ^.(cosS')  r*1  AXcosm(X-X')dX(2n  +  l)“1 

nmO  as  0  ^Xi 


Applying  the  mean  value  theorem  to  the  last  express  bn: 


■J  »0i+A9  *»0k'«0 

cov(Un,  vkl)  «  - — sinSdS  sinfl' d0'  dX  • 

oo  a 

•  E  ca’r  y.  AX  Pbb(cos8)  PBB(cos0')  cos  m(X-Xq)  (2n  +  l)-1 

K  =  0  1*0 

where  Xj  ^  Xq  ^  Xi  +  AX  .  Removing  the  common  factor  AX  from  the  summation: 


9k+A0 


,Xj+AX 


a\  «8l+A0  <»Ok  U«  (.A. 

cov(u1,vkl)  =  — —  I  sinddd  I  8in0'  d0* 

AljAklJ0i  Jgk  ^Xj 


0k+A0 


,Xj+AX 


■  E  c»’T  E  P„(COS0)  SB.(cos0’)  cos  m(X-  X,)(2n  +  1)"1  (A. 2) 

n  =  0  r<  =  0 


The  general  term  in  the  partially  integrated  series  is 

n 

(2n  +  l)1cau’T  E  Pn,(cos0)  Pn«(cos0')  cos  m(X- X))  =  c,»T  Pn( 

■  SO 

where  Q  =  (91 ,  Xq) ;  now  |ci,T  PB<  0^)  |  £  cB’r  for  all  n,  and  for  all  X  in  the 
interval  Xj  *  X  s  Xj  +  AX  ,  so  the  "M-Test"  is  satisfied  again  and  the  series  is 
uniformly  convergent,  and  thus  integrable,  with  respect  to  X  .  Therefore 

«\3  «8l+A0  ©if+AO  <* 

cov(ut4,vkl)  =  — —  J  sin0d0  J  sin©' d0  J]  cu/  • 

*0k  as  0 

a 

*  E  P».(cos0)  PB.(cos0')  cos  m(X»  -  Xq)  (2U+1)-1  (A.  3) 

•  sO 

where  Xj  s  X,  s  Xj+AX.  Once  more  the  general  term  of  the  twice  integrated 
series  satisfies  the  "M-Test",  because 

v  n 

|(2n  +  lf1cB’  £  Pb,(cos0)  PM(cos0’)  cos  m(X«-X„)  |  *  |cB,rpB(^)|  *  cB,r 

■*o 

(where  R  =  (0,X*))  for  all  n  and,  in  particular,  for  0*  in  the  interval 
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9k  s  9'  s  9k  +  A9.  Inconsequence 


cov(u,.,vkl)  = 


AAa  sfn$j  A9  T®1  o  «»,r 

— — —  stoSdS  £_c. 


11J  ukl 
c 


“e* 


c  ■  0 


•  £  Pa,(cos9)  P.B(cos0s)  cos  m(  A*- A9)  (2n  +  lf i  (A.  4) 


•=o 


Finally, 


|(2n  +  l)-1cnu,Y  £  P„(cos9)  PM(cos0s)  cosm(A»-AQ)|  s  |cnv  FU&*)  |  ^  ca’v 
*»0 

(where  X  =  (®s,Aq))  for  all  n  and,  in  particular,  for  9  in  the  interval 
Qi  £  9  s  9j  +  AS  so  the  last  integral  can  be  put  inside  of  the  summations,  and  the 
proof  of  the  term  by  term  integration  of  (4.5)  is  complete: 


cov(uu,vkl)  -  h£  ca*v  £  ■pa,(cos9r)  Pb,(cos9s)  cosm(A»-Aq)  (A. 5) 

B  3  0  *~0 

where  9,  s  9r  s  et  +  A9  and  h  =  AAasin  63  A9  sin 6,  A9/A,JAkl 
The  general  term  in  (A.  5)  satisfies  the  "M-Test": 

a 

u  v  -  u  V 

|za|-|(2n  +  l)-lca’  £  P».(  C08  9t)  7a»(COS9s)  cos  m(A»-Aq)  I  £  cB  ’ 

■  SO 


Since  the  series  a£0  ca,T  converges,  any  series  of  positive  terms  |za|  satisfying 
|  zB  |  <  cB  must  converge  also:  the  "M-Test"  condition  implies  the  absolute  con¬ 
vergence  of  ago  zn  .  Absolutely  convergent  series  can  have  the  order  of  their 
terms  changed  arbitrarily,  without  changing  the  value  of  their  limit  sums.  This 
justifies  the  reordering  of  (4,9)  that  leads  to  expression  (4. 11-a). 
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Appendix  B :  Computer  Programs  (Descriptions  and  Listings) 


This  Appendix  contains  the  description  and  listing  of  each  of  the  major 
subroutines  developed  in  the  course  of  this  research,  together  with  their  own 
auxiliary  routines.  The  listings  of  the  main  subroutines  contain  explanatory 
comments  that  the  author  hopes  will  be  of  help  to  those  who  may  use  them. 

The  description  accompanying  each  main  program  defines  arguments,  gives 
the  dimensions  of  the  memory  arrays  required,  mentions  the  relevant  for¬ 
mulas  from  preceeding  sections,  and  gives  a  brief  explanation  of  the  various 
segments  of  the  program. 

B.  1  General  Programming  Considerations 

Only  one -dimensional  arrays  are  used  in  the  software  described  here. 

The  language  used  for  all  of  them  Is  FORTRAN  IV  ;  some  subroutines  from 
the  International  Mathematical  and  Statistical  Libraries  Inc.  (IMSL)  are  called 
by  the  main  subroutines.  Implicit  DO  loops  in  READ  or  WRITE  statements  have 
been  avoided  as  much  as  possible,  because  their  execution  may  be  rather  slow, 
depending  on  the  compiler;  instead,  subroutines  FREAD,  FWRITE  and  REWIND 
are  used  for  all  imput/output  operations  involving  large  files  on  tape  or  disk, 
in  some  of  the  subroutines.  All  operations  involving  real  arithmetic  have  been  co 
ded  in  double  precision  (  8  bytes,  or  32  bits),  which  is  equivalent  to  retaining 
the  first  7  significant  figures  in  all  arithmetic  operations. 


The  arrays  containing  the  associated  Legendre  functions  or  their  integrals, 
as  the  case  may  be,  are  arranged  first  bv  degree,  and  then  by  order  :  00,  10, 
11,  20,  21,  22,  .  .  .(N,„  ,  N„.* ). The  ££,are  arranged  accordingly,  always  in 
two  separated  arrays:  one  for  the  c£n  another  for  the  Cn\  .  In  order  to 
get  the  value  of  the  element  "nut'  from  one  of  this  arrays,  the  following  formula 
is  used; 


k  =  £n(n  +  l)+m+l 

where  k  is  the  position  of  this  element  in  the  one -dimensional  array.  When 
the  elements  are  recovered  sequentially  from  the  beginning  (00),  the  following 
type  of  DO  loop  is  used; 


KOUNT  « 1 

Do  XX  N1  =  1 ,  Natx 

DO  XX  Ml  =  Nl,  N,.x 

LEGEND  (KOUNT)  »  ARRAY  (KOUNT)  **2 

XX  KOUNT  =  KOUNT  +  1 

where,  in  this  particular  example,  the  nm  (n=Nl-l,  m=Ml-l)  element  in  array  L 
LEGEND  is  equated  to  the  square  oi  the  nm  element  in  ARRAY.  Avoidance  of  two- 
dimensional  memory  arrays  results  in  considerable  improvements  in  efficiency. 
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B.2  Subroutine  SSYNTH 


This  subroutine  computes  the  sum  of  a  spherical 
harmonic  series  complete  to  degree  and  order  NMAX  at  each  one  of  the  2N2 
points  or  blocks  in  an  equal  angular  grid.  The  subroutine  can  calculate  point 
values  (IFLAG  =  0)  or  area  means  (IFLAG  =  1)  .  The  number  of  rows  or  paral¬ 
lels  N  (Nyquist  frequency)  must  be  even.  Subroutines  FFTP  from  the  IMSL  Double 
Precision  Library  is  used  to  calculate  the  sum  of  the  series  along  rows  by  means 
of  the  Mix  Radix  Fast  Fourier  Transform  algorithm. 

The  procedure  used  is  that  described  in  paragraphs  (1.6)  and  (1. 7).  The 
symmetry  of  the  grid  with  respect  to  the  equator,  and  the  corresponding  even- 
odd  symmetry  of  the  values  of  the  Legendre  functions  or  their  integrals,  (I.e. , 
the  x“*  of  section  1)  are  exploited.  The  values  of  those  functions,  or  of  their 
integrals,  are  read  from  mass  storage  (disk  or  tape)  into  array  ROW  ,  in  the 
order  described  in  the  previous  paragraph.  All  the  values  for  one  latitude,  or 
tow;* are  read  at  once,  so  the  dimension  of  ROW  is  +  1)(N,,X  +  2).  All  the  coef¬ 

ficients  Cf,  are  also  stored  in  core,  the  cor  esponding  RCNM  and  RSNM  ar¬ 
rays  (for  C‘„  and  5*,  respectively)  have  the  same  dimension  (the  Snot,  are 
included,  though  they  are  all  zero).  The  output  consists  of  2N3  values  in  ar¬ 
ray  DATA  .  This  array  is  organized  in  rows,  from  North  Pole  to  South  Pole. 

The  rows,  of  2N  points  or  blocks  each,  have  their  values  written  consecutively. 

The  following  is  the  list  of  arrays,  and  their  dimensions: 


NAME 

DIMENSION 

TYPE 

ROW 

RD4(N„x  +  1)(N,.X  +  2) 

REAL  *8 

RCNM 

RD 

T! 

RSNM 

RD 

Tf 

X 

RD 

INTEGER  *  2 

DATA 

2Na  (N  ■  180/BLOCK) 

REAL  *8 

CR1 

N  +  1 

ft 

CR2 

N  +  1 

Tf 

SRI 

N  +  1 

»f 

SR  2 

N  +  1 

Tf 

AM 

N  +  1 

Tf 

BM 

N  +  1 

ft 

F  AUX1 

4N 

Tf 

F  AUX2 

4N 

•T 

F  IWK 

see  IMSL  Handbook 

INTEGER  *  4 

F  LL 

ft  ?f  »» 

LOGICAL  *4 

F  A 

rt  ft  r* 

REAL  *8 

IV 

N  +  1 

tf 

("F"  designates  those  arrays  required  by  the  IMSL  subroutine  FFTP).  to  ad¬ 
dition,  the  size  (in  degrees)  of  the  blocks  Is  defined  by  BLOCK;  WPP  *  i(N„»x  +  1) 
(NUI+  2);  IU  is  the  number  of  the  unit  (disk,  tape)  from  which  the  Legendre 
functions  or  their  Integrals  are  to  be  read. 

Array  X  contains  Information  on  whether  a  given  x?"  is  even  or  odd; 
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arrays  CR1  ,  SRI  ,and  CR2  ,  SR2  ,  respectively,  contain  the  Fourier  coef¬ 
ficients  (ai)  and  of  rows  i  and  N-i-1  ;  arrays  AM  and  BM 

contain  the  values  of  A(m)  and  B(m)  ,  as  defined  in  expression  (1.  7);  the 
auxiliary  array  IV  contains  the  numbers  £n(n  +  1)  needed  to  locate  individual 
elements  within  arrays  ROW  ,  RCNM  ,  and  RSNM  ,  when  they  are  not  addressed 
sequentially. 

The  comments  in  the  listing  are  probably  enough  to  understand  most  of  it 
on  close  inspection;  one  point  however  may  be  worth  explaining  further:  the 
"aliasing"  of  the  Fourier  coefficients  has  been  incorporated  to  take  care  of  the 
case  when  NM,  >  N  -  1  .  to  such  situation  the  a,  (ai)  and  S^b1,)  become 
aliased,  as  Fourier  coefficients  must,  and  it  is  their  aliased  values  that  the  FFT 
subroutine  requires  to  compute  the  values  of  the  spherical  harmonic  expansion 
along  parallels.  The  formulas  for  the  aliased  coefficients  are 

KM 

a1.  3  a1.  +  Z  (°d*N  +  a*1!*-,)  (Bl-a) 

k=l 

-  Si  +  T"  (Sa*N  -  An-i)  (B  2-b) 

where  KM  is  a  large  enough  integer.  A  similar  expression  applies  to  a, 
and  to  b,  . 

The  arrangement  of  the  output  in  latitude  corresponds,  in  the  case  of  area  means , 
to  the  intervals  on  which  the  pB  ,  are  integrated;  for  point  values,  It  is  de¬ 
fined  by  the  latitudes  9S  at  which  the  PnB  (cos  9t)  have  been  precomputed.  As 
regards  longitude,  the  grid  starts  from  the  zero  meridian  used  for  defining 
the  coefficients.  In  the  case  of  point  values  it  Is  usual  to  compute  all  values  at 
the  center  of  each  block.  To  do  this,  the  FB  ■  must  be  precomputed  at  the  lati¬ 
tudes  of  the  center  points,  while  the  longitudes  are  taken  care  by  modifying  the 
coefficients  as  follows 

(TB’t  =  CB.  cosm  ^  +  S"aI  sinm  (B  2-a) 

Sa'«  =  S’n,  cos  m  ~  -Calsinm^  (B  2-b) 

This  is  equivalent  to  rotating  the  grid  eastwards  from  the  zero  meridian  by  ^  . 


B.3  Subroutine  BARMIN 

This  subroutine  implements  either  the  algorithm  of  paragraph  (1.5) 
for  the  harmonic  analysis  of  area  means,  or  that  of  paragraph  (1. 7)  for  the 
analysis  of  point  values. 

The  subroutine  calls  IMSL's  FFCSIN  to  calculate  the  a1,  ,  b[  or  the 
by  means  of  the  Fast  Fourier  Transform  (Mix  Radix)  algorithm,  it 
also  calls  subroutine  QUADFS  ,  that  returns  in  array  A  the  de-smoothing 
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factors  .  QUADFS  calls  LEGPOL  ,  a  subroutine  that  computes  the  Legendre 
polynomials  up  to  degree  NN  +  1  needed  for  the  fT^  in  (4  . 


The  data  is  arranged  as  in  SSYNT1I  ,  in  array  DATA  ,  before  the  subroutine 
is  called.  Afterwaxds,  the  contents  of  DATA  are  destroyed,  as  the  aa  ,  ba 
or  the  a  l ,  &1  are  formed  in  place  of  them,  row  by  row,  by  FFCS1N  .  The 
resulting  coefficients'  estimates  are  put  into  arrays  RCNM  and  RSNM  ,  in  the 
same  order  as  for  HARMIN  .  The  other  arrays,  with  the  exception  of  A  ,  are 
as  in  SSYNTH  .  The  same  is  true  of  the  scalars,  with  the  exception  of  NN  . 

NN  is  the  highest  degree  and  order  to  be  estimated.  NDD  is  the  total  number 
of  Legendre  functions,or  their  integrals,  lobe  read  from  unit  IU  ,  per  . 

This  number  is  (NN+1)  +(NN+2)/2.  A  is  a  REAL  *8  array  of  dimension  NN  +  1  . 
The  dimension  in  QUADFS  allows  for  a  maximum  NN  =  300;  for  larger  solutions, 
the  dimensions  there  and  in  LEGPOL  must  be  increased  accordingly. 

In  the  case  of  point  data,  the  estimated  coefficients  are  computed  using  a 
center  point  formula  that  assumes  that  the  data  are  situated  at  the  centers  of  the 
blocks;  the  resulting  coefficients  are  referred,  nonetheless,  to  a  grid  starting 
at  the  zero  meridian  (the  "rotation"  of  the  coefficients  takes  place  between 
statements  0071  and  0073,  when  IFLAG  =  0)  .  When  IFLAG=1  ,  the  area  means 
formula  (1.30)  is  computed;  the  x™  =  na(0)sin9dO  ,  and  the  /iB  are 

those  produced  by  QUADFS ,  as  already  mentioned.  The  integrals  of  the  Legendre 
functions  are  read  from  unit  "IU",  as  in  SSYNTH  (same  format),  and  the  size 
of  the  blocks  is  specified  by  BLOCK  (in  degrees).  The  version  of  QUADFS 
listed  here  implements  the  "composite"  estimator  of  paragraph  (3. 3).  If 
another  is  desired,  this  can  be  achieved  simply  by  replacing  lines  0021  through 
0024  in  QUADFS. 


B. 4  Subroutine  NORMAL 

oc 

This  subroutine  creates  the  optimal  estimatox*s  for  the  Ca,  based  on  the 
formation  and  inversion  of  the  R(m)  matrices  described  in  paragraph  (2. 10) 

The  algorithm  exploits  the  fact  that  (C„  +  D)  is  a  block  matrix  of  Toepiitz 
circulant  sub-matrices.  This  subroutine  is  meant  only  for  mean  values. 

The  grid  is  as  in  SSYNTH  and  HARMIN.  The  symmetry  with  respect  to  the 
the  equator  is  only  partly  exploited*  matrix  D  may  not  be  persym- 
metric,  so  the  total  matrix  (Czz  +  D)  may  not  be  so  either.  Czz  however,  is 
always  persymmetric,  and  this  is  taken  into  account  to  save  computing  and  stor¬ 
age.  A  general  diagonal  matrix  D  corresponds  to  a  rather  broad  class  of 
actual  problems, such  as  the  analysis  of  the  5°  x  5°  real  gravity  anomalies 
described  in  paragraph  (3.4). 

This  subroutine  requires  four  input/output  units:  8  (read  only)  contains 
the  values  of  the  integrals  of  the  Legendre  functions,  row  by  row,  arranged  as  in 
SSYNTH  or  HARMIN;  10  contains  the  right  hand  sides  of  the  "reduced  normals" 
„kj,B  =  R(m)  2£n*  (expression  (2. 58));  15  contains  the  R(m)  matrices,  ordered  by 
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increasing  m  ,  stored  in  vector  array  form  column  by  column;  30  stores  the 
Xi*  of  the  optimal  quadratu  res  -type  estimator.  The  xi*  are  stored  from  N. 
Pole  to  S.  Pole,  and  according  to  nm  ,  as  the  Legendre  integrals  and  the  coef¬ 
ficients.  In  some  circumstances  the  grid  may  be  geocentric  rather  than  geodetic 
and  a  change  of  coordinates  might  be  desirable:  this  can  be  achieved  by  setting 
the  parameter  IGEO  to  2  .  The  flattening  assumed  for  this  transformation  is 
F  =  1/298.257. 


After  the  R(m)  matrices  have  been  created,  they  are  inverted  by  IMSL 
subroutine  LUDECP  ,  that  performs  a  Choleskii  factorization.  IMSL  sub¬ 
routine  LUELMP  solves  the  equations  resulting  in  the  Xi"  ;  if  during  the  in¬ 
version  LUDECP  detects  an  ill-conditioned  (or  a  singular)  matrix,  the  solution 
part  is  avoided,  and  a  set  of  null  Xi*  is  stored  for  that  particular  m  .  As 
an  additional  check  for  the  stability  of  the  solution,  the  relative  residuals. 

N-l 

r  v»  t 

r'  “  *-°  where  v  =  [v0  ,  vx  ,  .  .  .  v^J  is 


v  =  k*"  -  R(m)  xaa  (computed) 


(B  3) 


are  computed  and  printed.  In  all  the  cases  studied  here  these  residuals  indicated  an 
agreement  of  at  least  9  significant  figures.  To  improve  the  stability  of  the 
solution,  a  regularizing  constant  REGUL  is  added  to  the  diagonal  elements  of 
the  R(m)  (Paragraph  (3.3)). 


Arrays  PN  ,  SS  ,  and  FC  contain  the  propagated  noise,  sampling,  and 
total  error  measure  (variance)  per  degree.  W  contains  the  averaged  row 
variances  (expression  (2.43))  arranged  from  North  to  South. 


The  scalar  arguments,  NMAX  ,  NN  ,  DGRID  ,  IGEO  ,  REGUL  ,  NRUN  ,  and 
NC  2  ,  are  described  in  the  comments  inserted  between  statements  0006  and  0010. 
The  arrays  are  as  follows: 


NAME  DIMENSION 


TYPE 


ROWP  ■§ (NMAX  + 1) (NMA X + 2)  REAL  *8 

ROWQ  "  " 

RHS  i-NN  (NN  +  1)  N  (N  =  Nyquist  freq. )  " 

S  (NN  +  1)  N  " 

A  k(N  -  1)+N  " 

UL  "  " 

W  N  " 

DVAR  NMAX  +1  " 

FC  NN  +  1  " 

PN  "  " 

SS 


Arrays  ANMPQ  ,  FF,  XO  ,  B  ,  X  ,  BT  ,  FINMP  (  all  REAL*  8),  and  1DD  (REAL*  4), 
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all  dimensioned  200  or  400  in  the  subroutine  itself,  are  large  enough  for 
problems  where  N  <  200  .  For  finer  grids,  the  size  of  these  arrays  should  be 
increased  in  die  same  proportion  as  that  of  N  . 

The R(m) matrices  are  formed  according  to  expression  (2.63)  .  Subroutine 
FUR  computes  the  "aliased"  Fourier  coefficients  of  the  covariance  functions 
that  are,  in  fact,  the  elements  of  the  R(m)  ,  scaled  by  N  or  2N  ,  depending  on 
m  .  Common  MM  and  array  MT  are  part  of  a  logic  set  up  to  ensure  that  the 
Fourier  coefficients  are  not  computed  more  than  once  each. 

Subroutine  ANALYS  uses  the  Xi*  stored  in  unit  30  to  analyze  the  data 
in  array  DATA  .  The  «,1,  are  formed  in  place,  as  in  HARMIN  ,  so  the 
original  values  in  DATA  are  destroyed.  Arrays  CR  ,  SR  ,  CAA  ,  CBB  ,  SAA  , 
and  SBB  have  all  the  same  description  as  CR1  ,  SRI  ,  etc. ,  in  HARMIN  . 

IMSL  subroutine  FFCSIN  (double  precision)  is  used  to  obtain  the  a,1  ,  /3,1 .  The 
reason  why  ANALYS  is  used  instead  of  HARMIN  ,  is  the  arrangement  of  the 
X*"  "columnwise",  or  by  increasing  latitudes,  rather  than  "rowise"  (i.e. ,  all 
the  Xi"  for  the  same  i  stored  together)  as  HARMIN  would  require. 

The  listings  of  FUR  ,  ANALYS  ,  and  those  of  the  fast  input/output  sub¬ 
routines  FREAD  ,  FWRITE  ,  REWIND  ,  are  given  after  that  of  NORMAL  . 

The  input/output  subroutines  have  dummy  arguments,  because  originally  NORMAL 
was  written  to  work  with  certain  subroutines  available  at  O.S.  U.  that  may  not 
be  in  the  software  libraries  of  other  institutions. 


B.  5  Subroutine  NORMAX 


A  modified  version  of  NORMAL  ,  this  subroutine  was  created  to  compute 
the  variance  of  the  estimation  errors  in  ordinary  quadratures  formulas  according 
to  the  theory  in  section  2  .  Essentially,  it  computes 
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by  forming  and  using  the  R(m)  .  Since  no  inversion  or  solution  of  the  normal 
equations  is  required,  the  corresponding  segments  have  been  removed  from 
NORMAL  ,  and  a  new  final  segment  added  for  the  computation  of  the  various 
accuracies. 


The  theory  behind  the  calculation  of  the  using  the  R(m)  matrices  Is 
as  follows*. 


In  the  case  of  ordinary  formulas  of  the  type  (see  expression  (1. 7)) 
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6“  =  (fn“)T  m 

N_1  3N-1  * 

=  Mn  7  £  Yn“(e,X)dcr  mu 

=  L  fxi"t  cosmj&X  +  |  sinmjAxlm,j 

i—o  L  j=oHB(m)J  *-  A(m)-'  J 

the  estimator  vector  is  a  combination  of  a  "sine  1  and  a  "cosine"  vector  of 
"frequency"  m  (terminology  introduced  in  paragraph  (2.  10)).  The  product  of 
such  vector  by  (CZI  +  D)  is,  because  of  the  structure  of  this  matrix,  another 
combination  of  a  "sine"  and  a  "cosine"  vector  of  the  same  "frequency".  From 
the  pi’operties  of  the  normal  matrix  follows  that 

cos  in  j  A  X  + 

B(m)  \ 

.  W sinmi4X  • 

where,  calling 

VS\  u?*  .  .  .  *£-*]  T  =  R(m)  [xo>»  XT»  •  .  •  x“i)T 


and 

F(m) 

(  2N  AX2  if  m  =  0  or  m  =  N 

1 4N  ( 1 -  cos  m  AX)  if  m  ^  0  ,  m  ^  N 

is 

(f?/ 

m2  N'i 

(C„  +D)  f£  =  F(m)  £  X?»r 

(B4-a) 

(C„+D)J“  = 


2N  if  m  =  0 
or  m  =  N 
N  if  ra  /  0 


M(m)\ 

WBfmv 


r  a 

Regarding  the  scalar  product  2coa0!fI_fn,  ,  it  is  easy  to  show  that 

2£*a.z  f“  -  2ss%nX (x?#)3  F(m)  (D4'b) 

Expressions  (B4-a)  and  (B4-b)  are  implemented  by  NORMAX  to  obtain  the 
error  variances.  This  subroutine  also  uses  FUR  .  Subroutine  QUADRS  is 
also  called,  to  obtain  the  de-smoothing  factors.  In  the  case  listed  here,  this 
factor  is  .  Array  QUADS  has  been  added  to  the  list  of  arguments, 

and  it  contains  the  after  the  call  to  QUADFS  . 


B.  6  Subroutine  LEGFDN 

This  subroutine  computes  the  normalized  Legendre  functions  and,  if  so 
desired,  their  derivatives  at  a  given  0t  .  All  values  correspond  to  the  same 
order  M  ;  if  more  than  one  order  at  a  time  is  needed,  a  DO  loop,  where  the 
subroutine  is  called  once  for  each  order,  can  be  set  up.  The  subroutine  is 
based  on  formulas  (4.19  a-b)  and  (1.38  a-b).  The  use  of  this  subroutine  is 
explained  by  the  comments  Inserted  In  the  listing.  The  stability  of  Die  recursive 
formulas  was  tested  by  computing  PnB  (cosO)  and  (dl»,/d 0)(cos0)  for  m  --  350 
and  350  -■  n  •  400  ,  2,5°  s  o  **  90°,  at  5°  intervals.  Calculations  were  done 
first  in  double  precision  (8  byte  words)  and  then  repeated  in  extended  precision 
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(16  byte  words).  Both  sets  of  results  agreed  with  each  other  to  better  than 
10  significant  figures. 


B.  7  Subroutine  NVAR 

This  subroutine  computes  the  degree  variances  of  the  gravity  anomalies 
(point  values), up  to  degree  100,  according  to  the  coefficients  obtained  by 
R.  Rapp  from  a  complete,  equal  angular  set  of  mean  1°  x  1°  anomalies,  as  mentioned 
in  par.  (3.1).  Above  n  =  100  ,  the  subroutine  uses  a  two-term  model  to  calcu¬ 
late  the  a\  (Ag)  .  The  resulting  degree  variances  are  stored  in  array  DVAR  . 

The  first  element  in  DVAR  is  a?  (Ag)  . 
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B.  8  SUBROUTINE  COVBLK 


Subroutine  COVBLK  calculates  covariances  between  area  means  according  to  expres¬ 
sion  (4.14).  This  subroutine  is  listed  in  the  following  pages.  The  arguments  are 
explained  in  the  comments  at  the  beginning  of  the  listing.  In  addition,  the 
following  things  have  to  be  born  in  mind:  the  dimension  of  the  array  DVAR 
is  Nmax;  the  dimension  of  both  RINS  and  RINN  is  ?(Nmax+l)(NclaxJ‘2);  the 
dimension  of  COVS  is  360/BLOCK.  The  values  returned  in  DVAR  are  the 
original  degree  variances  c“’  ,  each  divided  by  2n  +  l.  If  LB  is  less  than 
360 /BLOCK,  the  LB  +  1,  LB +  2,  ...  ,(360 /BLOCK  -  LB  - 1)  elements  in  COVS 
are  returned  as  zeroes,  the  remainder  contain  the  first  (and  last)  LB  covari¬ 
ances.  The  dimension  of  F  is  180/BLOCK  (Nyquist  frequency).  To  use  this 
subroutine  with  Nmax  >  400,  the  arrays  FF  and  H>D  (whose  dimension  should 
be  no  less  than  Nmax)i  should  be  redimensioced. 

The  subroutine  does  not  take  advantage  of  the  ’’aliasing"  of  Fourier  coeffi¬ 
cients  built  into  expression  (4.14).  Implementing  this  aspect  should  lead  to 
some  additional  improvement  in  efficiency.  The  Fourier  series  is  computed, 
orcn  the  .'.-efficients  have  been  determined,  by  multiplying  each  coefficient  by 
the  i.orrnsnouning  cosine  of  mAj  and  adding  the  products  together.  The  values 
cJ.  cos  rn/v,  are  computed  using  the  following  recursive  formula: 

coa  a/j  =  2  cosAj  cos(m-l)Xj  -  cos(m-2Uj 

which  avoids  repeated  calculation  of  the  FORTRAN  COS  function  (only  cos  A.  is 
required  to  start  the  recursion).  Actual  calculation  of  the  Fourier  series  requires 
about  0.04  seconds  in  the  most  time  consuming  case:  the  grid  of  1°  blocks.  The 
greater  part  of  the  time  taken  by  this  subroutine  goes  into  finding  the  Fourier 
coefficients  of  the  mean  value  covariances.  For  this  reason,  there  is  not  much 
difference  between  computing  all  covariances  in  a  certain  row,  or  just  a  few  of 
them,  using  this  procedure. 
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